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METHODS AND COMPOSITIONS FOR SIMULTANEOUS 
SACCHARIFICATION AND FERMENTATION 

Related Information 

This application claims priority to U.S. provisional application number 
60/214,137, entitled "Synergistic Hydrolysis of Carboxymethyl Cellulose and Acid 
Swollen Cellulose by Two Endoglucanases (EGZ and EGY)," filed June 26, 2000, and 
U.S. provisional application number 60/219,913, entitled "Methods and Compositions 
for Simultaneous Saccharification and Fermentation," filed July 21, 2000, both of which 
are incorporated herein in their entirety by this reference. The contents of all patents, 
patent applications, and references cited throughout this specification are hereby 
incorporated by reference in their entireties. 

Government Sponsored Research 

This work was supported, in part, by grants from the U.S. Department of 
Agriculture, National Research Initiative (98-35504-6177 and 98-35505-6976), the U.S. 
Department of Energy, Office of Basic Energy Science (FG02-96ER20222), and the 
Florida Agricultural Experiment Station, University of Florida. 

Background of the Invention 

Many environmental and societal benefits would result from the replacement of 
petroleum-based automotive fuels with renewable fuels obtained from plant materials 
(Lynd et al. 9 (1991) Science 251 .1318-1323; Olson et aL, (1996) Enzyme Microb. 
TechnoL 18:1-17; Wyman^a/., (1995) Amer. Chem. Soc. Symp. 618:272-290). Each 
year, the United States burns over 120 billion gallons of automotive fuel, roughly 
equivalent to the total amount of imported petroleum. The development of ethanol as a 
renewable alternative fuel has the potential to eliminate United States dependence on 
imported oil, improve the environment, and provide new employment (Sheehan, (1994) 
ACS Symposium Series No. 566, ACS Press, pp 1-53). 

In theory, the solution to the problem of imported oil for automotive fuel appears 
quite simple. Rather than using petroleum, a finite resource, ethanol, a renewable 
resource, can be produced efficiently by the fermentation of plant material. Indeed, 
Brazil has demonstrated the feasibility of producing ethanol and the use of ethanol as a 
primary automotive fuel for more than 20 years. Similarly, the United States produces 
over 1.2 billion gallons of fuel ethanol each year. Currently, fuel ethanol is produced 
from corn starch or cane syrup utilizing either Saccharomyces cerevisiae or Zymomonas 
mobilis (Z mobilis). However, neither of these sugar sources can supply the volumes 
needed to realize a replacement of petroleum-based automotive fuels. In addition, both 
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cane sugar and corn starch are relatively expensive starting materials, which have 
competing uses as food products. 

Moreover, these sugar substrates represent only a fraction of the total 
carbohydrates in plants. Indeed, the majority of the carbohydrates in plants are in the 
form of lignocellulose, a complex structural polymer containing cellulose, 
hemicellulose, pectin, and lignin. Lignocellulose is found in, for example, the stems, 
leaves, hulls, husks, and cobs of plants. Hydrolysis of these polymers releases a mixture 
of neutral sugars including glucose, xylose, mannose, galactose, and arabinose. No 
known natural organism can rapidly and efficiently metabolize all of these sugars into 
ethanol. 

Nonetheless, in an effort to exploit this substrate source, the Gulf Oil Company 
developed a method for the production of ethanol from cellulose using a yeast-based 
process termed simultaneous saccharification and fermentation (SSF) (Gauss et al 
(1976) U.S.P.N. 3,990,944). Fungal cellulase preparations and yeasts were added to a 
slurry of the cellulosic substrate in a single vessel. Ethanol was produced concurrently 
during cellulose hydrolysis. However, Gulfs SSF process has some shortcomings. For 
example, fungal cellulases have been considered, thus far, to be too expensive for use in 
large scale bioethanol processes (Himmel et al, (1997) Amer. Chem. Soc. pp. 2-45; 
Ingram etal, (1987) Appl Environ. Microbiol 53:2420-2425; Okamoto et al, (1994) 
Appl Microbiol Biotechnol 42:563-568; Philippidis, G., (1994) Amer. Chem. Soc. pp. 
188-217; Saito etal, (1990) J. Ferment. Bioeng. 69:282-286; Sheehan, J., (1994) Amer. 
Chem. Soc. pp 1-52; Su et al, (1993) Biotechnol Lett. 15:979-984). 

Summary of the Invention 

The development of inexpensive enzymatic methods for cellulose hydrolysis has 
great potential for improving the efficiency of substrate utilization and the economics of 
the saccharification and fermentation process. Accordingly, developing enzymes and, 
preferably, biocatalysts that produce such enzymes which can be used for the efficient 
depolymerization of a complex sugars and subsequent rapid fermentation of the sugar 
into alcohol, would be of great benefit. 

Certain microbes, such as Erwinia chrysanthemi, produce a number of hydrolase 
and lyase enzymes, which are very effective in the degrading of plant tissues containing 
complex sugars. In particular, this organism produces two different endoglucanase 
activities (comprising EGY and EGZ) which have been discovered to function, when 
used in particular amounts, as highly effective enzyme compositions for degrading 
complex sugars. These enzymes may be used as crude extracts having a desired mixture 
of endoglucanase activity or, preferably, may be used as purified compositions. 
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Moreover, a biocatalyst, preferably a recombinant bacterium, more preferably a 
ethanologenic bacterium, can be engineered to express one or more of these enzymatic 
activities in particular amounts sufficient for degrading complex sugars. Such a 
biocatalyst is suitable for the efficient degradation of complex sugars and subsequent 
fermentation into alcohol by a process known as simultaneous saccharification and 
fermentation (SSF). An advantage of the above endoglucanase compositions or 
biocatalysts is that the need for additional fungal cellulases for degrading the complex 
sugars is reduced or eliminated. 

The present invention provides endoglucanase activities for carrying out the 
degrading of a complex sugar and more preferably, the use of endoglucanase activities 
in particular ratios for optimal degrading of a complex sugar. 

In addition, the invention provides recombinant host cells engineered for optimal 
expression and secretion of endoglucanase activities suitable for degrading complex 
sugars. Specifically exemplified are recombinant enteric bacteria, Escherichia and 
Klebsiella, which express an endoglucanase under the transcriptional control of a 
surrogate promoter for optimal expression. In addition, also exemplified is a 
recombinant enteric bacterium that expresses two different endoglucanases eel Y and 
celZ, where each is under the transcriptional control of a surrogate promoter for optimal 
expression in a particular ratio. 

The invention provides for the further modification of these hosts to include a 
secretory protein/s that allow for the increased production and/or secretion of the 
endoglucanases from the cell. In a preferred embodiment, the invention provides for the 
further modification of these hosts to include exogenous ethanologenic genes derived 
from an efficient ethanol producer, such as Zymomonas mobilis. 

Accordingly, these hosts are capable of expressing high levels of proteins that 
may be used alone or in combination with other enzymes or recombinant hosts for the 
efficient production of alcohol from complex sugars. 

More particularly, in a first aspect, the invention provides a composition for 
degrading an oligosaccharide containing, a first endoglucanase having a first degrading 
activity, and a second endoglucanase having a second degrading activity, where the first 
and second degrading activities are present in a ratio such that the degrading of the 
oligosaccharide by the first and second endoglucanases is synergized. 

In a second aspect, the invention provides a method for degrading an 
oligosaccharide comprising, contacting an oligosaccharide with a first endoglucanase 
having a first degrading activity and a second endoglucanase having a second degrading 
activity, where the first and second degrading activities are present in a ratio such that 
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the degrading of the oligosaccharide by the first and second endoglucanases is 
synergized. 

In one embodiment of the above aspects, the contacting of the oligosaccharide 
with the first endoglucanase and the second endoglucanase is performed in any order or 
concurrently. 

In one embodiment of the above aspects, the first endoglucanase or the second 
endoglucanase, or both the first and the second endoglucanases, are derived from a cell 
extract. The cell extract is derived from a bacterial cell, e.g., a bacterial cell that has 
been recombinantly engineered to express the first endoglucanase or the second 
endoglucanase, or both the first and the second endoglucanases. In a related 
embodiment, the bacterial cell is selected from the family Enterobacteriaceae, and 
preferably, is either Escherichia or Klebsiella, and more preferably contains a first 
endoglucanase that is encoded by celZ and a second endoglucanase that is encoded by 
celY, and where celZ and celY are derived from Erwinia. 

In another embodiment of the above aspects, the first endoglucanase is EGZ and 
the second endoglucanase is EGY, preferably in a ratio ranging from about 1:1 to, more 
preferably, about 9:1 to about 19:1. 

In still another embodiment of the above aspects, the first endoglucanase or the 
second endoglucanase, or both the first and the second endoglucanase, are purified. 

In even another embodiment of the above aspects, the degrading of an 
oligosaccharide is synergized by a factor ranging from about 1.1 to about 2.0, and 
preferably by about 1.8. 

In yet another embodiment of the above aspects, the composition contains an 
additional enzyme, e.g., an endoglucanase, exoglucanase, cellobiohydrolase, p- 
glucosidase, endo-l,4-|3-xylanase, a-xylosidase, oc-glucuronidase, a-L- 
arabinofuranosidase, acetylesterase, acetylxylanesterase, a-amylase, P-amylase, 
glucoamylase, pullulanase, p-glucanase, hemicellulase, arabinosidase, mannanase, 
pectin hydrolase, pectate lyase, or a combination thereof. 

In a related embodiment of the above aspects, the additional enzyme is a 
glucanase derived from a fungus, preferably T. longibranchiatum. 

In another related embodiment of the above aspects, the additional enzyme is an 
ethanologenic enzyme, preferably an ethanologenic enzyme such as pyruvate 
decarboxylase or alcohol dehydrogenase. 

In another embodiment of the above aspects, the first endoglucanase and the 
second endoglucanase are packaged separately. 

In another embodiment of the above aspects, the composition is used for 
simultaneous saccharification and fermentation. 
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In still another embodiment of the above aspects, the oligosaccharide is a 
cellooligosaccharide, lignocellulose, hemicellulose, cellulose, pectin, or any 
combination thereof. 

In still another embodiment of the above aspects, the composition or method is, 
respectively, used or conducted in an aqueous solution. 

In a third aspect, the invention provides a recombinant host cell suitable for 
degrading an oligosaccharide containing a first heterologous polynucleotide segment 
encoding a first endoglucanase having a first degrading activity, where the segment is 
under the transcriptional control of a surrogate promoter; and a second heterologous 
polynucleotide segment encoding a second endoglucanase having a second degrading 
activity, where the segment is under the transcriptional control of a surrogate promoter, 
and where the first endoglucanase and the second endoglucanase are expressed so that 
the first and the second degrading activities are present in a ratio such that the degrading 
of the oligosaccharide by the first and second endoglucanases is synergized. 

In one embodiment, the first endoglucanase or the second endoglucanase, or both 
the first and the second endoglucanases are secreted. In a related embodiment, the first 
endoglucanase or the second endoglucanase, or both the first and the second 
endoglucanases, are derived from a cell extract. The cell extract is derived from a 
bacterial cell, e.g., a bacterial cell that has been recombinantly engineered to express the 
first endoglucanase or the second endoglucanase, or both the first and the second 
endoglucanases. In a related embodiment, the bacterial cell is selected from the family 
Enterobacteriaceae, and preferably, is either Escherichia or Klebsiella, and more 
preferably E. coli B, E. coli DH5a, or Klebsiella oxytoca. 

In yet another embodiment of the above aspects, the composition contains an 
additional enzyme, e.g., an endoglucanase, exoglucanase, cellobiohydrolase, P- 
glucosidase, endo-l,4-P-xylanase, oc-xylosidase, oc-glucuronidase, a-L- 
arabinofuranosidase, acetylesterase, acetylxylanesterase, a-amylase, P-amylase, 
glucoamylase, pullulanase, P-glucanase, hemicellulase, arabinosidase, mannanase, 
pectin hydrolase, pectate lyase, or a combination thereof. 

In one embodiment, the first endoglucanase is encoded by celZ and the second 
endoglucanase is encoded by celY, and where celZ and celYare derived from Erwinia. 

In a related embodiment of the above aspects, the additional enzyme is a 
glucanase derived from a fungus, preferably T. longibranchiatum. 

In another related embodiment, the first endoglucanase is EGZ and the second 
endoglucanase is EGY. 

In another related embodiment, the additional enzyme is a secretory enzyme, 
preferably a pul or out gene product. 
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In anotn^r related embodiment of the above aspects, the host cell is 
ethanologenic, e^kE. coli K04 (ATCC 55123); E. coli KOI 1 (ATCC 55124), E. coli 

KOI 2 (ATCC 55 12^1 and E. coli LY01 (ATCC ), K. oxytoca M5A1, and K. 

oxytoca P2 (ATCC 553^7). 

In a fourth aspect, the invention provides a method for enhancing the degradation 
of an oligosaccharide by contacting an oligosaccharide with a host cell containing, a first 
heterologous polynucleotide segment encoding a first endoglucanase having a first 
degrading activity, where the segment is under the transcriptional control of a surrogate 
promoter; and a second heterologous polynucleotide segment containing a sequence 
encoding a second endoglucanase having a second degrading activity, where the 
segment is under the transcriptional control of a surrogate promoter. The method further 
provides that the first endoglucanase and the second endoglucanase are expressed so that 
the first and the second degrading activities are present in a ratio such that the degrading 
of the oligosaccharide by the first and second endoglucanases is synergized and thereby 
enhanced. 

In one embodiment of the above aspect, the first endoglucanase or the second 
endoglucanase or both the first and the second endoglucanases are secreted. 

In another embodiment, the host cell of the above method is ethanologenic. 

In another embodiment, the method is conducted in an aqueous solution. 

In even another embodiment, the method is used for simultaneous 
saccharification and fermentation. 

In yet another embodiment, the method includes degrading an oligosaccharide 
selected from the group consisting of cellooligosaccharide, lignocellulose, 
hemicellulose, cellulose, pectin, or any combination thereof. 

In a fifth aspect, the invention provides a method of making a recombinant host 
cell suitable for degrading an oligosaccharide by introducing into the host cell a first 
heterologous polynucleotide segment encoding a first endoglucanase having a first 
degrading activity, where the segment is under the transcriptional control of a surrogate 
promoter; and a second heterologous polynucleotide segment containing a sequence 
encoding a second endoglucanase having a second degrading activity, where the 
segment is under the transcriptional control of a surrogate promoter. The method further 
provides that the first and second endoglucanases are expressed such that the first and 
the second degrading activities are present in a ratio such that the degrading of the 
oligosaccharide by the first and second endoglucanases is synergized. 

In one embodiment of the above aspect, the first endoglucanase or the second 
endoglucanase or both the first and second endoglucanases are secreted. 

In another embodiment, the host cell is ethanologenic. 
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In even another embodiment, the first endoglucanase is encoded by celZ and the 
second endoglucanase is encoded by celY, and celZ and celY are derived from Erwinia. 

In still another embodiment, the surrogate promoter of the first heterologous 
polynucleotide segment or the second heterologous polynucleotide segment or both the 
5 first and second polynucleotide segments, contains a polynucleotide fragment derived 
from Zymomonas mobilis. 

In yet another embodiment, the recombinant host cell is suitable for 
simultaneous saccharification and fermentation, and preferably, is ethanologenic. 

In a sixth aspect, the invention provides a method for making a recombinant host 
10 cell integrant by introducing into the host cell a vector containing the polynucleotide 
sequence of pLOI2352 (SEQ ID NO: 17) and identifying a host cell having the vector 
stably integrated. 

In a seventh aspect, the invention provides a method for expressing a 
endoglucanase in a host cell by introducing into the host cell a vector containing the 
15 polynucleotide sequence of pLOI2306 (SEQ ID NO: 12) and identifying a host cell 
expressing the endoglucanase. 

In an eighth aspect, the invention provides a method for producing ethanol from 
an oligosaccharide source by contacting the oligosaccharide source with a ethanologenic 
host cell containing a first heterologous polynucleotide segment encoding a first 
20 endoglucanase having a first degrading activity, where the segment is under the 
transcriptional control of a surrogate promoter; and a second heterologous 
polynucleotide segment encoding a second endoglucanase having a second degrading 
activity, where the segment is under the transcriptional control of a surrogate promoter. 
The method further provides that the first and second endoglucanases are expressed so 
25 that the first and the second degrading activities are present in a ratio such that the 

degrading of the oligosaccharide by the first and second endoglucanases is synergized 
resulting in a degraded oligosaccharide that is fermented into ethanol. 

In one embodiment, the first endoglucanase is encoded by celZ and the second 
endoglucanase is encoded by ce/Fgene, and celZ and celY are derived from Erwinia. 
30 In another embodiment, the host cell further contains a heterologous 

polynucleotide segment encoding at least one pul gene or out gene. 

In even ^N? other embodiment, the host cell is selected from the family 
Enterobacteriaceae^preferably Escherichia or Klebsiella, more preferably E. coli K04 
(ATCC 55123), E. cdfc KOll (ATCC 55124), E. coli K012 (ATCC 55125), LY01 

35 (ATCC ), K. oxytoca M5A1, or K. oxytoca P2 (ATCC 55307). 

In another embodiment, the method is conducted in an aqueous solution. 
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In still another embodiment, the oligosaccharide is selected from the group 
consisting of cellooligosaccharide, lignocellulose, hemicellulose, cellulose, pectin, and 
any combination thereof. 

In another embodiment, the heterologous polynucleotide segment is, or derived 
from, pLOI 2352 (SEQ ID NO: 17). 

In yet another embodiment, the first endoglucanase is EGZ and the second 
endoglucanase is EGY. 

In another embodiment, the surrogate promoter of the first polynucleotide 
segment or the second polynucleotide segment, or both the first and the second 
polynucleotide segments contains a polynucleotide fragment derived from Zymomonas 
mobilis. 

In ninth aspect, the invention provides a vector containing the polynucleotide 
sequence of a plasmid, or fragment thereof, of pLOI231 1, pLOI1620, pLOI2316, 
pLOI2317, pLOI2318, pLOI2319, pLOI2320, pLOI2323, pLOI2342, pLOI2348, 
pLOI2349, pLOI2350, pLOI2352, pLOI2353, pLOI2354, pLOI2355, pLOI2356, 
pLOI2357, pLOI2358, or pLOI2359. 

In a tenth aspect, the invention provides a host cell containing a vector having 
the polynucleotide sequence of a plasmid, of fragment thereof, of pLOI23 1 1 , pLOI1620, 
pLOI2316, pLOI2317, pLOI2318, pLOI2319, pLOI2320, pLOI2323, pLOI2342, 
pLOI2348, pLOI2349, pLOI2350, pLOI2352, pLOI2353, pLOI2354, pLOI2355, 
pLOI2356, pLOI2357, pLOI2358, or pLOI2359. 

In one embodiment, the host is Klebsiella oxytoca strain P2 (pCPP2006), 
Klebsiella oxytoca strain SZ6 (pCPP2006), Klebsiella oxytoca strain SZ21 (pCPP2006), 
or Klebsiella oxytoca strain SZ22 (pCPP2006). 

In an eleventh aspect, the invention provides a method for degrading an 
oligosaccharide by obtaining a first endoglucanase having a first degrading activity, 
obtaining a second endoglucanase having a second degrading activity, and contacting an 
oligosaccharide with the first and second endoglucanases, where the first and second 
degrading activities are present in a ratio such that the degrading of the oligosaccharide 
by the first and second endoglucanases is synergized. 

In a twelfth aspect, the invention provides a method for enhancing the degrading 
of an oligosaccharide by contacting an oligosaccharide with a first endoglucanase 
having a first degrading activity and a second endoglucanase having a second degrading 
activity, where the first and second degrading activities are present in a ratio such that 
the degrading of the oligosaccharide by the first and second endoglucanases is 
synergized and thereby enhanced. 
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In a related aspect, the invention provides a method for degrading, and/or for enhancing the 
degrading of, an oligosaccharide by contacting an oligosaccharide with a first endoglucanase having a 
first degrading activity and a second endoglucanase having a second degrading activity, where the first 
and second degrading activities are present in a ratio such that the degrading of the oligosaccharide by 
the first and second endoglucanases results in a change in viscosity, preferably, a reduction in viscosity, 
more preferably by an amount of at least , e.g., 5 centopoise, 10 centopoise, 20 centopoise, 50 
centopoise, 100 centopoise, 500 centopoise, or 1000 centopoise or more, or within a range thereof. 

In a preferred embodiment, the oligosaccharide is cellulose, e.g., amorphous cellulose or 
crystalline cellulose and may be from a source such as, e.g., paper, pulp, or plant fiber. 

In a thirteenth aspect, the invention provides a recombinant host cell suitable for 
degrading an oligosaccharide containing a first heterologous polynucleotide segment encoding a 
first endoglucanase; and a second heterologous polynucleotide segment encoding a second 
endoglucanase. 

Ina related aspect, the host cell is suitable for reducing the viscosity of an 
oligosaccharide by comprising a first heterologous polynucleotide segment encoding a first 
endoglucanase; and a second heterologous polynucleotide segment encoding a second 
endoglucanase. 

In one embodiment, the first heterologous polynucleotide segment is under the 
transcriptional control of a surrogate promoter, and the second heterologous polynucleotide 
segment is under the transcriptional control of a surrogate promoter. 

In another embodiment, the cell is a bacterial cell, preferably selected from the family 
Enterobacteriaceae, more preferably from the genus Escherichia or Klebsiella. 

In another embodiment, the first endoglucanase is encoded by celZ and the second 
endoglucanase is encoded by celY, and celZ and celY are derived from Erwinia. 

In another embodiment, the first endoglucanase is EGZ and said second endoglucanase 

is EGY. 

In a fourteenth aspect, the invention provides a recombinant host strain of Klebsiella 
oxytoca strain P2 (pCPP^006) represented by a deposit with the American Type Culture 

Collection designated as deposit number ATCC . 

In a fifteenth aspect, the invention provides a recombinant host strain of Klebsiella 
oxytoca strain SZ6 (pCPP2006) represented by a deposit with the American Type Culture 

Collection designated as deposit number ATCC . 

In a sixteenth aspect, the invention provides a recombinant host strain of Klebsiella 
oxytoca strain SZ2 1 (pCPP2006) represented by a deposit with the American Type Culture 
Collection designated as deposit number A rfC . 
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In a seventeenth aspect, the invention provides a recombinant host strain of 
Klebsiella oxytoca strain SZ22 (pCPP2006) represented by a deposit with the American 

Type Culture CollectiWi designated as deposit number ATCC . 

In a eighteenth aspect, the invention provides a recombinant cell containing a 
first heterologous polynucleotide segment encoding a first endoglucanase; and a second 
heterologous polynucleotide segment encoding a second endoglucanase, where the first 
polynucleotide segment encoding a first endoglucanase or the second polynucleotide 
segment encoding a second endoglucanase, or both are sufficiently homologous in an 
amino acid alignment to either the gene product of celY or celZ from Erwinia as to share 
the functional activity of being capable of degrading a polysaccharide. 

In nineteenth aspect, the invention provides an extract or composition derived 
from a host cell of the invention, e.g., a secreted polypeptide, a lysate or broth, or a pure, 
semi-pure, or unpurified enzymatic extract or polypeptide which is suitable for 
degrading and/or reducing the viscosity of a oligosaccharide when contacted thereto. 

Other features and advantages of the invention will be apparent from the 
following detailed description and claims. 

Brief Description of the Drawings 

Figure 1 shows fermentation rates for the ethanologenic recombinant host E. 
coli KOI 1 using rice hull substrates pretreated with dilute acid and supplemented with 
two different media. 

Figure 2 shows simultaneous saccharification and fermentation (SSF) rates for 
the ethanologenic recombinant host strain K. oxytoca P2 using mixed waste office paper. 
Insoluble residues from SSF were recycled as a source of bound cellulase enzymes and 
substrate during subsequent fermentations. 

Figure 3 shows the structure of the plasmid pLOI21 71 , a low copy promoter 
probe vector showing the orientation of the kanamycin resistance gene (kari) for 
selection, the temperature sensitive pSClOl replicon (Rep(ts)) for episomal maintenance 
of the plasmid, and the promoterless polysaccharase gene celZ encoding phospho-beta- 
glucosidase (EGZ). 

Figure 4 is a graph showing the high correspondence between the size of the 
zone of clearance on CMC indicator plates (x-axis) measured for a transformed bacterial 
colony and the amount of glucanase activity expressed (y-axis). 

Figure 5 shows the partial nucleotide sequence (SEQ ID NO: 1) of the Z mobilis 
DNA fragment in the pLOI2183 plasmid that functions as a surrogate promoter. The 
full sequence has been assigned GenBank Accession Number AF 109242 (SEQ ID NO: 
2). Indicated are two transcriptional start sites (#) , -35 and -10 regions, the Shine- 
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Delgarno site (bold), partial vector and celZ sequence (lowercase), and the celZ start 
codon (atg indicated in bold). 

Figure 6 represents electron micrographs of E. coli DH5oc cells harboring 
different plasmids expressing little if any (pUC19; panel A), moderate (pLOI2164; panel 
B), and high levels (pLOI2307; panel C) of glucanase in the form of periplasmic 
inclusion bodies (pib) localized between the outer cell wall and the inner membrane 
(im). The bar shown represents 0.1 \xm. 

Figure 7 shows a schematic detailing the cloning strategy used to construct the 
celZ integration vector pLOI2306, a genetic construct capable of being introduced into 
the genome of a recombinant host and conferring stable glucanase expression activity to 
the host. 

Figure 8 shows a schematic representation of the celZ integration vector 
pLOI2306 (SEQ ID NO: 12) with the locations of the surrogate promoter from Z 
mobilis, the celZ gene from E. chrysanthemi, resistance markers (bla and tef), and K. 
oxytoca target sequence indicated. 

Figure 9 shows graphical depictions of the synergistic action of EGY and EGZ. 
Both enzymes were diluted to equal CMCase activities (1.5 IU/ml) with calculated 
synergies shown in parenthesis. Panel A shows a graph depicting the effect of enzyme 
ratios on synergy. Different amounts of EGY and EGZ were combined to maintain a 
constant predicted activity (0.15 IU/ml) based on the contribution of individual 
enzymes. Assays were incubated with CMC for 1 hour at 35°C and terminated by 
boiling. Numbers on the X axis indicate the proportions of EGZ and EGY. Synergy is 
shown above each bar. Panel B shows a graph depicting the hydrolysis of CMC by EGZ 
and EGY, alone and in combination (9 parts EGZ + 1 part EGY). All assays contained 
equal total activities (0.15 IU/ml) based on the sum of individual EGY and EGZ 
activities. Synergy is shown above each point for the combination of both enzymes. 
Panel C shows a graph depicting the hydrolysis of acid-swollen cellulose by EGZ and 
EG Y, alone and in combination. A 9 to 1 ratio of EGZ to EGY was used for the 
combined enzyme reaction. All assays contained 1.5 IU/ml based on the sum of 
individual EGY and EGZ activities. Synergy is shown above each point for the 
combination of both enzymes. 

Figure 10 represents a thin layer chromatography (TLC) analysis of the 
hydrolysis products from two complex sugars: acid- swollen cellulose and Avicel®. 
Approximately 1 .5 IU and 25 IU of CMCase were used in reactions with acid-swollen 
cellulose and Avicel®, respectively. Abbreviations for Y axis: Gl 5 glucose; G2, 
cellobiose; G3, cellotriose; G4, cellotetraose; and G5, cellopentaose. Lanes: S, mixed 
cellooligosaccharide standard; C 5 control lacking enzyme; Z, EGZ; Y, EGY, and Z + Y: 
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EGZ + EGY. Panel A shows results with acid-swollen cellulose after a 6-hour 
incubation with CMCase (1 \xl loading). Panel B shows results with acid-swollen 
cellulose after a 6-hour incubation with CMCase (2 |il loading). Panel C shows results 
with Avicel® after a 48-hour incubation with CMCase (10 |al loading). 

Figure 11 represents a TLC analysis showing the hydrolysis of 
cellooligosaccharides by EGZ and EGY. Each test contained approximately 0.07 IU of 
CMCase per ml (2 hour incubation, 35°C). Abbreviations: S, mixed 
cellooligosaccharides standard; Gl, glucose; G2 5 cellobiose; G3, cellotriose; G4 ? 
cellotetraose; and G5, cellopentaose. Panel A shows the substrates before hydrolysis, 
Panel B shows the substrates after incubation with EGY, Panel C shows the substrates 
after incubation with EGZ, and Panel D shows EGZ hydrolysis of cellopentaose after 
different periods of incubation (0, 5, 10, and 25). 

Figure 12 is a model illustrating the utilization of amorphous cellulose by E. 
chrysantheml Three glucosidases are used for the catabolism of amorphous cellulose. 
Two of these, EGY and EGZ are extracellular endoglucanases, which function, together 
in a synergistic fashion. EGY requires large substrate molecules and hydrolyzes these 
into shorter, insoluble fragments. EGY does not hydrolyze soluble 
cellooligosaccharides (2 to 5 glucosyl residues). EGZ readily hydrolyzes soluble 
cellooligosaccharides (cellopentaose and cellotetraose) and amorphous fragments of 
intermediate length to produce cellobiose and cellotriose. Cellobiose and cellotriose are 
phosphorylated during cellular uptake by a phosphoenolpyruvate-dependent 
phosphotransferase system. Hydrolysis is completed intracellularly by a third enzyme, 
phospho-P-glucosidase. Resulting monomeric products (glucose and glucose-6- 
phosphate) are metabolized by glycolysis. 

Figure 13 is a schematic representation of the construction of a promoter-probe 
vector for celY. Sau3A\ fragments of Z mobilis chromosomal DNA were ligated into 
the BamHI site of pLOI23 17 to provide a strong, surrogate promoter for celY coding 
region (solid segments). Z mobilis DNA fragments (promoter 1 and promoter 2) are 
shown as open segments. Replicons and antibiotic resistance genes are stippled; other 
vector DNA is shown as thin connecting lines. Arrows indicate direction of 
transcription. 

Figure 14 is a depiction of transcriptional initiation sites and putative promoter 
regions for the celY promoter in DH5a (pLOI2323). 

Figure 15 is a schematic representation of the construction of pLOI2352 for the 
functional integration of celYand celZ into the chromosome of ethanologenic K. oxytoca 
P2. Coding regions for celY and celZ are shown as solid segments. Fragments of Z 
mobilis DNA that serve as promoters (proml and prom2) are shown as open segments. 
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Replicons and antibiotic resistance genes are stippled; other vector DNA is shown as a 
thin connecting line. Arrows on segments indicate the direction of transcription. The 
small open arrows represent the FRT sites, which are recognized by theflp recombinase. 
FRT sequences are asymmetrical and arranged to allow the deletion of plasmid DNA 
(replicon and selectable marker) after chromosomal integration. 

Figure 16 is a schematic representation of the construction of pLOI2357 for the 
inactivation of celZ by double homologous recombination. Coding regions for eel Y arc 
shown as solid segments. The fragment of Z mobilis DNA that serves as a promoter 
(proml) is shown as open segments. Replicons and antibiotic resistance genes are 
stippled; other vector DNA is shown as a thin connecting line. Arrows on segments 
indicate the direction of transcription. The small open arrows represent the FRT sites, 
which are recognized by the flp recombinase. 

Figure 77 shows a digital image of a thin layer chromatogram of fermentation 
broth illustrating the utilization of cellobiosides. The left panel (A) represents strain 
P2(pCPP2006), the parent lacking E. chrysanthemi endoglucanases, whereas the right 
panel (B) represents strain SZ2 1 (pCPP2006), the recombinant secreting high levels of 
CelY and CelZ endoglucanases. Approximately 4 \\L of broth was spotted in each lane 
and labels Gl through G6 refer to the number of glucosyl residues in the adjacent spots. 
Lanes for both panels (A and B) are from left to right: 1, initial (0 h); 2, after 10 h; and 
3, after 36 h of fermentation. 

Figure 18 shows a graphical depiction of the amount of ethanol production from 
amorphous cellulose by ethanologenic derivatives of K. oxytoca M5A1. Results shown 
in the top and middle panel (A and B) include standard deviations from three replicates; 
results shown in the bottom panel (C) represent single fermentations. The top panel (A) 
shows results from the fermentation of 6.85 g/L amorphous cellulose; the middle panel 
(B) shows results from the fermentation of 15.33 g/L amorphous cellulose; and the 
bottom panel (C) shows the fermentation of 28.96 g/L amorphous cellulose. Strain 
P2(pCPP2006) is the parental strain and lacks E. chrysanthemi endoglucanase genes. 
Strain SZ6(pCPP2006) secretes CelZ endoglucanase; strain SZ21(pCPP2006) secretes 
CelY and CelZ endoglucanases; strain SZ22(pCPP2006) secretes CelY endoglucanase. 
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Detailed Description of the Invention 

In order for the full scope of the invention to be clearly understood, the following 
definitions are provided. 

L Definitions 

As used herein the term "recombinant host" is intended to include a cell suitable 
for genetic manipulation, e.g., which can incorporate heterologous polynucleotide 
sequences, e.g., which can be transfected. The cell can be a microorganism or a higher 
eukaryotic cell. The term is intended to include progeny of the cell originally 
transfected. In preferred embodiments, the cell is a bacterial cell, e.g., a Gram-negative 
bacterial cell, and this term is intended to include all facultatively anaerobic Gram- 
negative cells of the family Enterobacteriaceae such as Escherichia, Shigella, 
Citrobacter, Salmonella, Klebsiella, Enter obacter, Erwinia, Kluyvera, Serratia, 
Cedecea, Morganella, Hafnia, Edwardsiella, Providencia, Proteus, and Yersinia. 
Particularly preferred recombinant hosts are Escherichia coli or Klebsiella oxytoca cells. 

The term "ratio" is intended to include the relationship between the amounts 
(measured, e.g., by activity or in moles) of two enzymes in a predetermined combination 
where, preferably, the ratio is not naturally occurring and, more preferably, results in 
synergistic enzyme activity. 

The terms "a first endoglucanase having a first degrading activity" and "a second 
endoglucanase having a second degrading activity" are intended to include, respectively, 
an endoglucanase with an activity that can be distinguished from another endoglucanase 
(e.g., a second endoglucanase) with a second activity functionally (e.g., by its activity on 
a particular substrate; synergism with another enzyme), by source of origin (e.g., host 
cell strain, including naturally occurring strains or genetically modified strains 
expressing a clone expressing an endoglucanase), or by biochemical properties using art 
recognized techniques (e.g., molecular weight determination or purification 
characteristics). The degrading activity, e.g., the enzymatic hydrolysis of an 
oligosaccharide, can also comprise a change in the viscosity of the oligosaccharide. 

The terms "synergism " "synergistic activity," and "synergized" are intended to 
describe the interaction between distinguishable polypeptides or polypeptide activities 
wherein the effect of the total activity of the polypeptides taken together are greater than 
the sum of the effects of the individual activities. The polypeptides may be, for 
example, endoglucanases, exoglucanases, cellobiohydrolases, p-glucosidases, endo-1,4- 
P-xylanases, oc-xylosidases, a-glucuronidases, a-L-arabinofuranosidases, 
acetylesterases, acetylxylanesterases, oc-amylases, p -amylases, glucoamylases, 
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pullulanases, p-glucanases, hemicellulases, arabinosidases, mannanases, pectin 
hydrolases, pectate lyases, or any combination thereof. An activity of a polypeptide 
includes the degradation (e.g., hydrolysis) of an oligosaccharide but may also include a 
change in the viscosity of the oligosaccharide. The synergized degrading of an 
oligosaccharide is preferably by a factor of about 1.1, 1.2,. 1.3, 1.4, 1.5, 1.6, 1.7, 1.8, 1.9, 
or more preferably about 2.0, with a typical factor being about 1.8. 

The term "heterologous polynucleotide segment" is intended to include a 
polynucleotide segment that encodes one or more polypeptides or portions or fragments 
of polypeptides. A heterologous polynucleotide segment may be derived from any 
source, e.g., eukaryotes, prokaryotes, virii, or synthetic polynucleotide fragments. 

The terms "polysaccharase," "cellulase," or "glucanase" are used 
interchangeably herein and are intended to include a polypeptide capable of catalyzing 
the degradation or depolymerization of any linked sugar moiety, e.g., disaccharides, 
trisaccharides, oligosaccharides, including, complex carbohydrates, also referred to 
herein as complex sugars, e.g., cellooligosaccharide and lignocellulose, which comprises, 
cellulose, hemicellulose, and pectin. The terms are intended to include cellulases such 
as glucanases, including preferably, endoglucanases but also including, e.g., 
exoglucanase, p-glucosidase, cellobiohydrolase, endo-l,4-p-xylanase, p-xylosidase, a- 
glucuronidase, a-L-arabinofuranosidase, acetylesterase, acetylxylanesterase, oc-amylase, 
p-amylase, glucoamylase, pullulanase, p-glucanase, hemicellulase, arabinosidase, 
mannanase, pectin hydrolase, pectate lyase, or a combination of any of these cellulases. 

The term "endoglucanase" is intended to include a cellulase which typically 
hydrolyses internal pi -4 glucosyl linkages in polymeric substrates and does not 
preferentially hydrolyze linkages located at the ends of the chain. 

The term "surrogate promoter" is intended to include a polynucleotide segment 
that can transcriptionally control a gene-of-interest that it does not transcriptionally 
control in nature. In a preferred embodiment, the transcriptional control of a surrogate 
promoter results in an increase in expression of the gene-of-interest. In a preferred 
embodiment, a surrogate promoter is placed 5' to the gene-of-interest. A surrogate 
promoter may be used to replace the natural promoter, or may be used in addition to the 
natural promoter. A surrogate promoter may be endogenous with regard to the host cell 
in which it is used or it may be a heterologous polynucleotide sequence introduced into 
the host cell, e.g., exogenous with regard to the host cell in which it is used. Other 
promoters suitable for use in bacteria include, e.g., lacZ, T7, and SP6 (see, e.g., Ausubel 
et al. infra). 

The terms "oligosaccharide source," "oligosaccharide," "complex cellulose," 
"complex carbohydrate," and "complex sugar," and "polysaccharide" are used 
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essentially interchangeably and are intended to include any carbohydrate source 
comprising more than one sugar molecule. These carbohydrates may be derived from 
any unprocessed plant material or any processed plant material. Examples are wood, 
paper, pulp, plant derived fiber, or synthetic fiber comprising more than one linked 
carbohydrate moiety, i.e., one sugar residue. One particular oligosaccharide source is 
lignocellulose, which represents approximately 90% of the dry weight of most plant 
material and contains carbohydrates, e.g., cellulose, hemicellulose, pectin, and aromatic 
polymers, e.g., lignin. Cellulose makes up 30%-50% of the dry weight of lignocellulose 
and is a homopolymer of cellobiose (a dimer of glucose). Similarly, hemicellulose, 
makes up 20%-50% of the dry weight of lignocellulose and is a complex polymer 
containing a mixture of pentose (xylose, arabinose) and hexose (glucose, mannose, 
galactose) sugars which contain acetyl and glucuronyl side chains. Pectin makes up 1%- 
20% of the dry weight of lignocellulose and is a methylated homopolymer of glucuronic 
acid. Other oligosaccharide sources include carboxymethyl cellulose (CMC), 
amorphous cellulose (e.g., acid-swollen cellulose), and the cellooligosaccharides 
cellobiose, cellotriose, cellotetraose, and cellopentaose. Cellulose, e.g., amorphous 
cellulose may be derived from a paper or pulp source (including, e.g., fluid wastes 
thereof) or, e.g., agricultural byproducts, e.g., corn stalks, soybean solubles, or beet pulp. 
Any one or a combination of the above carbohydrate polymers are potential sources of 
sugars for depolymerization and subsequent bioconversion to ethanol by fermentation 
according to the products and methods of the present invention. 

The term "gene/s" or "polynucleotide segment" is intended to include nucleic 
acid molecules, e.g., polynucleotides which include an open reading frame encoding a 
polypeptide, and can further include non-coding regulatory sequences, and introns. In 
addition, the terms are intended to include one or more genes that map to a functional 
locus, e.g., the out or pul genes of Erwinia and Klebsiella, respectively, that encode 
more than one gene product, e.g., a secretory polypeptide. In addition, the terms are 
intended to include a specific gene for a selected purpose. The gene may be endogenous 
to the host cell or may be recombinantly introduced into the host cell, e.g., as a plasmid 
maintained episomally or a plasmid (or fragment thereof) that is stably integrated into 
the genome. In a preferred embodiment, the gene of polynucleotide segment is involved 
in at least one step in the bioconversion of a carbohydrate to ethanol. Accordingly, the 
term is intended to include any gene encoding a polypeptide such as an alcohol 
dehydrogenase, a pyruvate decarboxylase, a secretory protein/s, or a polysaccharase, 
e.g., a glucanase, such as an endoglucanase or exoglucanase, a cellobiohydrolase, P- 
glucosidase, endo-l,4-p-xylanase, P-xylosidase, a-glucuronidase, ot-L- 
arabinofuranosidase, acetylesterase, acetylxylanesterase, oc-amylase, P-amylase, 
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glucoamylase, pullulanase, p-glucanase, hemicellulase, arabinosidase, mannanase, 
pectin hydrolase, pectate lyase, or a combination thereof. 

The term "simultaneous saccharification and fermentation" or "SSF" is intended 
to include the use of one or more recombinant hosts (or extracts thereof, including 
purified or unpurified extracts) for the contemporaneous degradation or 
depolymerization of a complex sugar and bioconversion of that sugar residue into 
ethanol by fermentation. 

The term "transcriptional control" is intended to include the ability to modulate 
gene expression at the level of transcription. In a preferred embodiment, transcription, 
and thus gene expression, is modulated by replacing or adding a surrogate promoter near 
the 5' end of the coding region of a gene-of-interest thereby resulting in altered gene 
expression. In a most preferred embodiment, the transcriptional control of one or more 
gene is engineered to result in the optimal expression of such genes, e.g., in a desired 
ratio. The term also includes inducible transcriptional control as recognized in the art. 

The term "expression" is intended to include the expression of a gene at least at 
the level of mRNA production. 

The term "expression product" is intended to include the resultant product of an 
expressed gene, e.g., a polypeptide. 

The term "increased expression" is intended to include an alteration in gene 
expression at least at the level of increased mRNA production and preferably, at the 
level of polypeptide expression. 

The term "increased production" is intended to include an increase in the amount 
of a polypeptide expressed, in the level of the enzymatic activity of the polypeptide, or a 
combination thereof. 

The terms "activity" and "enzymatic activity" are used interchangeably and are 
intended to include any functional activity normally attributed to a selected polypeptide 
when produced under favorable conditions. The activity of an endoglucanase (e.g., 
EGY or EGZ) is, for example, the ability of the polypeptide to enzymatically 
depolymerize a complex saccharide. Typically, the activity of a selected polypeptide 
encompasses the total enzymatic activity associated with the produced polypeptide. The 
polypeptide produced by a host cell and having enzymatic activity may be located in the 
intracellular space of the cell, cell-associated, secreted into the extracellular milieu, or a 
combination thereof. Techniques for determining total activity as compared to secreted 
activity are described herein and are known in the art. 

The term "secreted" is intended to include an increase in the secretion of a 
polypeptide into the periplasmic space or into the extracellular milieu, e.g., a 
heterologous polypeptide, preferably a polysaccharase. Typically, the polypeptide is 
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secreted at an increased level that is in excess of the naturally-occurring amount of 
secretion. More preferably, the term "secreted" refers to an increase in secretion of a 
given polypeptide that is at least 10% and more preferably, at least about 100%, 200%, 
300,%, 400%, 500%, 600%, 700%, 800%, 900%, 1000%, or more, as compared to the 
naturally-occurring level of secretion. 

The term "secretory polypeptide" is intended to include any polypeptide/s, alone 
or in combination with other polypeptides, that facilitate the transport of another 
polypeptide from the intracellular space of a cell to the extracellular milieu. In one 
embodiment, the secretory polypeptide/s encompass all the necessary secretory 
polypeptides sufficient to impart secretory activity to a Gram-negative host cell. 
Typically, secretory proteins are encoded in a single region or locus that may be isolated 
from one host cell and transferred to another host cell using genetic engineering. In a 
preferred embodiment, the secretory polypeptide/s are derived from any bacterial cell 
having secretory activity. In a more preferred embodiment, the secretory polypeptide/s 
are derived from a host cell having Type II secretory activity. In another more preferred 
embodiment, the host cell is selected from the family Enterobacteriaceae. In a most 
preferred embodiment, the secretory polypeptide/s are one or more gene products of the 
out or pul genes derived from, respectively, Erwinia or Klebsiella. Moreover, the 
skilled artisan will appreciate that any secretory protein/s derived from a related host 
that is sufficiently homologous to the out or pul gene/s described herein may also be 
employed (Pugsley et aL, (1993) Microbiological Reviews 57:50-108; Lindeberg et aL, 
(1996) Mol Micro. 20:175-190; Lindeberg et aL, (1992) J. of Bacteriology 174:7385- 
7397; He etaL, (1991) Proc. Natl Acad Set USA, 88:1079-1083). 

The term "derived from" is intended to include the isolation (in whole or in part) 
of a polynucleotide segment from an indicated source or the purification of a 
polypeptide from an indicated source. The term is intended to include, for example, 
direct cloning, PCR amplification, or artificial synthesis from, or based on, a sequence 
associated with the indicated polynucleotide source. 

The term "ethanologenic" is intended to include the ability of a microorganism 
to produce ethanol from a carbohydrate as a primary fermentation product. The term 
includes but is not limited to naturally occurring ethanologenic organisms, organisms 
with naturally occurring or induced mutations, and organisms that have been genetically 
modified. 

The term "Gram-negative bacteria" is intended to include the art recognized 
definition of this term. Typically, Gram-negative bacteria include, for example, the 
family Enterobacteriaceae which comprises, among others, the species Escherichia and 
Klebsiella, 
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The term "sufficiently homologous" is intended to include a first amino acid or 
nucleotide sequence which contains a sufficient or minimum number of identical or 
equivalent amino acid residues or nucleotides, e.g., an amino acid residue which has a 
similar side chain, to a second amino acid or nucleotide sequence such that the first and 
second amino acid or nucleotide sequences share common structural domains and/or a 
common functional activity. For example, amino acid or nucleotide sequences which 
share common structural domains have at least about 40% homology, preferably 50% 
homology, more preferably 60%, 70%, 80%, or 90% homology across the amino acid 
sequences of the domains and contain at least one, preferably two, more preferably 
three, and even more preferably four, five, or six structural domains, are defined herein 
as sufficiently homologous.- Furthermore, amino acid or nucleotide sequences which 
share at least 40%, preferably 50%, more preferably 60%, 70%, 80%, or 90% homology 
and share a common functional activity are defined herein as sufficiently homologous. 

In one embodiment, two polynucleotide segments, e.g., promoters, are 
"sufficiently homologous" if they have substantially the same regulatory effect as a 
result of a substantial identity in nucleotide sequence. Typically, "sufficiently 
homologous" sequences are at least 50%, more preferably at least 60%, 70%, 80%, or 
90% identical, at least in regions known to be involved in the desired regulation. More 
preferably, no more than five bases differ. Most preferably, no more than five 
consecutive bases differ. 

To determine the percent identity of two polynucleotide segments, or two amino 
acid sequences, the sequences are aligned for optimal comparison purposes (e.g., gaps 
can be introduced in one or both of a first and a second amino acid or nucleic acid 
sequence for optimal alignment and non-homologous sequences can be disregarded for 
comparison purposes). In a preferred embodiment, the length of a reference sequence 
aligned for comparison purposes is at least 30%, preferably at least 40%, more 
preferably at least 50%, even more preferably at least 60%, and even more preferably at 
least 70%, 80%, or 90% of the length of the reference sequence. The amino acid 
residues or nucleotides at corresponding amino acid positions or nucleotide positions are 
then compared. When a position in the first sequence is occupied by the same amino 
acid residue or nucleotide as the corresponding position in the second sequence, then the 
molecules are identical at that position (as used herein amino acid or nucleic acid 
"identity" is equivalent to amino acid or nucleic acid "homology"). The percent 
identity between the two sequences is a function of the number of identical positions 
shared by the sequences, taking into account the number of gaps, and the length of each 
gap, which need to be introduced for optimal alignment of the two sequences. 
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The comparison of sequences and determination of percent identity between two 
sequences can be accomplished using a mathematical algorithm. In a preferred 
embodiment, the percent identity between two amino acid sequences is determined 
using the Needleman and Wunsch {J. Mol. Biol. (48):444-453 (1970)) algorithm which 
5 has been incorporated into the GAP program in the GCG software package (available at 
http://www.gcg.com), using either a Blossom 62 matrix or a PAM250 matrix, and a gap 
weight of 16, 14, 12, 10, 8, 6, or 4 and a length weight of 1, 2, 3, 4, 5, or 6. In yet 
another preferred embodiment, the percent identity between two nucleotide sequences is 
determined using the GAP program in the GCG software package (available at 
10 http://www.gcg.com), using a NWSgapdna.CMP matrix and a gap weight of 40, 50, 60, 
70, or 80 and a length weight of 1, 2, 3, 4, 5, or 6. In another embodiment, the percent 
identity between two amino acid or nucleotide sequences is determined using the 
5l algorithm of E. Meyers and W. Miller (CABIOS, 4:11-17 (1989)) which has been 

2: incorporated into the ALIGN program (version 2.0), using a PAM120 weight residue 

yi 15 table, a gap length penalty of 12 and a gap penalty of 4. 

The polynucleotide and amino acid sequences of the present invention can 
sj further be used as a "query sequence" to perform a search against public databases to, 

^ for example, identify other family members or related sequences, e.g., promoter 

sequences. Such searches can be performed using the NBLAST and XBLAST programs 
M 20 (version 2.0) of Altschul, et al (1990) J. Mol Biol 215:403-10. BLAST nucleotide 

S; £ searches can be performed with the NBLAST program, score = 100, wordlength = 12 to 

obtain nucleotide sequences homologous to polynucleotide molecules of the invention. 
BLAST protein searches can be performed with the XBLAST program, score = 50, 
wordlength = 3 to obtain amino acid sequences homologous to polypeptide molecules of 
25 the invention. To obtain gapped alignments for comparison purposes, Gapped BLAST 
can be utilized as described in Altschul et al, (1997) Nucleic Acids Res. 25(17):3389- 
3402. When utilizing BLAST and Gapped BLAST programs, the default parameters of 
the respective programs {e.g., XBLAST and NBLAST) can be used. See 
http://www.ncbi.nlm.nih.gov. 

30 

IL Synergism Between Endoglucanases 

The present invention is based, at least in part, on the discovery that 
endoglucanases act synergistically in degrading complex sugars. This invention is 
based, also in part, on the functional integration and expression of two endoglucanases 
35 {e.g., EGY and EGZ) by an ethanologenic host cell {e.g., K. oxytoca P2) to effect 

synergistic degradation of oligosaccharides {e.g., crystalline cellulose) and increase the 
production of ethanol by simultaneous saccharification and fermentation (SSF). 
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In one embodiment, an endoglucanase is derived from E. chrysanthemi and is 
the endoglucanase EGZ, which is encoded by the celZ gene (Boyer, et ah (1987) Eur. J. 
Biochem. 162:31 1-316). In another embodiment, an endoglucanase is derived from E. 
chrysanthemi and is the endoglucanase EGY, which is encoded by the celY gene 
(Guiseppi et ah, (1991) Gene 106:109-1 14). E. chrysanthemi EGY and EGZ are 
endoglucanases that have high activities in the degradation of carboxymethyl cellulose 
(CMC) and belong to, respectively, Type IV and Type II secretion groups (Hueck et ah 
(1998) Micro and Mo I Biol Rev 62:379-433). EGY and EGZ differ in substrate range 
and function synergistically during the hydrolysis of CMC and amorphous cellulose, 
indicating a potential need for both enzymes for optimal cellulase activity. Specifically, 
EGZ hydrolyzes cellotriose, cellotetraose, cellopentaose, amorphous cellulose, and 
CMC. EGY hydrolyses polymeric substrates to products of approximately 10 glucosyl 
residues. 

In another embodiment, the endoglucanases {e.g., EGY and EGZ) are purified 
separately and combined in a ratio sufficient for the synergistic degradation of an 
oligosaccharide substrate to occur and this may be determined using the assays disclosed 
herein. These assays allow for a determination and optimization of a ratio between, e.g., 
two given glucanases, e.g., endoglucanases. Typically the ratios range from about 9 to 1 
to about 1 9 to 1 . In one embodiment, the ratio can be about 9 to 1 or 1 9 to 1 for EGZ to 
EGY. In a preferred embodiment, optimum synergy is observed with a high ratio of 
EGZ to EGY, similar to that produced by E. chrysanthemi and by SZ21, a K. oxytoca 
recombinant which expresses celY and celZ (See example 4). 

In yet another embodiment, the endoglucanases can be combined concurrently 
with the oligosaccharide substrate. In yet another embodiment, the endoglucanases can 
be added to the oligosaccharide substrate sequentially. In a preferred embodiment, EGZ 
is sequentially added to the substrate following the addition of EGY, after heat 
inactivation of EGY activity. Example 3 describes, in detail, the synergistic effect of 
various ratios of endoglucanases {e.g., EGY and EGZ), the synergistic effect of 
sequential addition of endoglucanases {e.g., EGY and EGZ) (See Table 12), and the 
effect of various substrate concentrations (See Table 11) and incubation times on the 
synergistic activity of the endoglucanases. 

In yet another embodiment, the synergistic degradation of the oligosaccharide is 
of a factor of about 1.1, 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8, 1.9, or 2.0 with a more preferable 
factor being about 1.8. The synergistic factor is calculated as the observed degradation 
divided by the sum of predicted contributions from EGY alone and EGZ alone (Riedel, 
et ah (1997) FEMS Microbiol. Lett. 147:239-243). In one embodiment, the 
endoglucasases are EGY and EGZ. The predicted contribution of EGY alone is about 
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10% of the total activity, and the predicted contributions of EGZ alone are about 90% of 
the total activity. 

In another aspect of the invention, at least one of the endoglucanases is derived 
from a cell extract. The cell may be recombinantly engineered to produce at least one 
endoglucanase. In a preferred embodiment, the cell is recombinantly engineered to 
produce two endoglucanases (e.g., EGY and EGZ). For example, the cell can be a 
bacterial cell. The recombinant cell comprises at least one heterologous polynucleotide 
segment, or, preferably, two or more heterolgous polynucleotides segments, encoding a 
polypeptide/s under the transcriptional control of one or more heterologous surrogate 
promoter/s. The heterologous polynucleotide and surrogate promoter may be plasmid 
based or integrated into the genome of the organism (as described in the examples). In a 
preferred embodiment, the host cell is used as a source of a desired polypeptide for use 
in the bioconversion of a complex sugar to ethanol, or a step thereof. In another 
preferred embodiment, the heterologous polynucleotide segment encodes one or more 
endoglucanases {e.g., EGY or EGZ) which are expressed at higher levels than are 
naturally occurring in the host. In one embodiment, the endoglucanases are purified 
separately from different recombinant cells and subsequently combined to 
synergistically degrade a substrate (e.g., an oligosaccharide). In another embodiment, 
one recombinant host cell can produce two or more endoglucanases concurrently and 
can act synergistically to degrade a substrate. 

In another aspect of the invention, the recombinant bacterial host cell is an 
ethanologenic bacterium such as, for example, K. oxytoca P2, an ethanologic derivative 
ofM5Al (Wood, etal (1992) Appl. Environ. Microbiol 58:2103-2110). In one 
embodiment, the recombinant ethanologenic bacterium contains at least one 
heterologous polynucleotide segment (e.g., celY or celZ derived from Erwinia) encoding 
at least one endoglucanase (e.g., EGY or EGZ). In a preferred embodiment, the 
recombinant ethanologenic bacteria contains more than one heterologous polynucleotide 
segments which encode endoglucanases. For example, as described in detail in Example 
4, eel Y and celZcan be functionally integrated, expressed, and secreted from the 
ethanologic strain K. oxytoca P2 concurrently to produce ethanol from an 
oligosaccharide substrate (e.g., crystalline cellulose). 



$~\/ In another embodiment, the recombinant host is a Gram-negative bacterium. In 
yet another embodimert, the recombinant host is from the family Enterobacteriaceae. 
The ethanologenic hostsWU.S.P.N. 5,821,093, hereby incorporated by reference, for 
example, are suitable hostsWd include, in particular, E. coli strains K04 (ATCC 
55123), KOI 1 (ATCC 55124V and K012 (ATCC 55125), and Klebsiella oxytoca strain 
P2 (ATCC 55307). Alternatively, a non-ethanologenic host of the present invention 
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may be converted into an ethanologenic host (such as the above-mentioned strains) by 
introducing, for example, ethanologenic genes from an efficient ethanol producer like 
Zymomonas mobilis. This type of genetic engineering, using standard techniques, 
results in a recombinant host capable of efficiently fermenting sugar into ethanol. In 
addition, the LY01 ethanol tolerant strain may be employed as described in published 
PCT international application WO 98/45425 and this published application is hereby 
incorporated by reference (see also, e.g., Yomano et al (1998) J. oflnd. Micro. & Bio. 
20:132-138). 

In another preferred embodiment, the invention makes use of a non- 
ethanologenic recombinant host, e.g., E. coli strain B, E. coli strain DH5cc, or Klebsiella 
oxytoca strain M5A1. These strains may be used to express at least one desired 
polypeptide, e.g., an endoglucanase, using techniques described herein. In addition, 
these recombinant hosts may be used in conjunction with another recombinant host that 
expresses yet another desirable polypeptide, e.g., a different endoglucanase. For 
example, a recombinant host producing EGZ can be combined with a recombinant host 
producing EGY to produce a synergistic effect. In addition, the non-ethanologenic host 
cell/s may be used in conjunction with an ethanologenic host cell. For example, the use 
of a non-ethanologenic host/s for carrying out, e.g., the synergistic depolymerization of 
a complex sugar may be followed by the use of an ethanologenic host for fermenting the 
depolymerized sugar. Accordingly, it will be appreciated that these reactions may be 
carried out serially or contemporaneously using, e.g., homogeneous or mixed cultures of 
non-ethanologenic and ethanologenic recombinant hosts. 

In a preferred embodiment, one or more genes for fermenting a sugar substrate 
into ethanol are provided on a plasmid or integrated into the host chromosome. More 
preferably, genes for fermenting a sugar substrate into ethanol, e.g., pyruvate 
decarboxylase (e.g.,pdc) and/or alcohol dehydrogenase (e.g., adh) are introduced into 
the host of the invention using an artificial operon such as the PET operon as described 
in U.S. P.N. 5,821,093, hereby incorporated by reference. Indeed, it will be appreciated 
that the present invention, in combination with what is known in the art, provides 
techniques and vectors for introducing multiple genes into a suitable host (see, e.g., 
Current Protocols in Molecular Biology, eds. Ausubel et al, John Wiley & Sons (1992), 
Sambrook, J. et al, Molecular Cloning: A Laboratory Manual 2nd, ed., Cold Spring 
Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY 
(1 989), and Bergey 's Manual of Determinative Bacteriology, Kreig et al , Williams and 
Wilkins (1984), hereby incorporated by reference). 

Accordingly, using the methods of the invention, a single genetic construct can 
encode all of the necessary gene products (e.g., a glucanase, an endoglucanase, an 
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exoglucanase, a secretory protein/s 5 pyruvate decarboxylase, alcohol dehydrogenase) for 
performing simultaneous saccharification and fermentation (SSF). For example, 
Example 4 describes, in detail, the simultaneous saccharification and fermentation (SSF) 
of crystalline cellulose (Sigmacell 50) by bacterial cellulases EGY and EGZ produced 
by ethanologenic K. oxytoca, with added commercial cellulase (Spezyme®). The 
endoglucanases produced by ethanologenic K. oxytoca and the commercial cellulase 
(Spezyme®) function synergistically to increase ethanol production (7% to 22%) from 
crystalline cellulose (Sigmacell 50). The beneficial effect is attributed almost 
exclusively to EGY, despite the fact that EGY activities were low in comparison to 
EGZ. Activity of the ethanologenic K. oxytoca strain SZ22, which expresses EGY, was 
nearly equivalent to the activity of the ethanologenic K. oxytoca strain SZ21, which 
expresses both EGY and EGZ activities. K. oxytoca strain SZ6, which expresses only 
EGZ showed little benefit from the production of over 20,000 U of endoglucanase 
activity per liter. 

In one embodiment, the composition of two endoglucanases which act 
synergistically to degrade an oligosaccharide also includes at least one additional 
enzymatic activity. This additional activity may be a glucanase activity selected from 
the group consisting of endoglucanase, exoglucanase, cellobiohydrolase, P-glucosidase, 
endo-l,4-p-xylanase, oc-xylosidase, a-glucuronidase, a-L-arabinofuranosidase, 
acetylesterase, acetylxylanesterase, oc-amylase, P-amylase, glucoamylase, pullulanase, 
P-glucanase, hemicellulase, arabinosidase, mannanase, pectin hydrolase, pectate lyase, 
or a combination thereof. In another embodiment, this additional enzymatic activity 
may be derived from a fungus, for example T. longibranchiatum. Fungi such as T. 
longibranchiatum produce multiple endoglucanase activities, which are presumed to 
function together with exoglucanases during the hydrolysis of crystalline cellulose 
(Nidetzky, et al (1995) Synergistic interaction of cellulases from Trichoderma reesei 
during cellulose degradation p.90-1 12. In J. N. Saddler and M. E. Himmel (ed.). 
Enzymatic degradation of insoluble carbohydrates ACS symposium series 618, 
American Chemical Society, Washington, D.C.; Tomme, et al (1995) Adv. Microbiol 
Physiol 37:1-81; Woodward, J. (1991) Bioresource Technol 36:67-75). 

In contrast to EGZ, EGY does not hydrolyze soluble cellobiosides but 
preferentially acts on longer chain substrates, producing ends, which can function as 
new sites for exoglucanase activity. In the absence of fungal cellulase additions, EGY 
and EGZ function synergistically to degrade amorphous cellulose. In nature, 
lignocellulosic substrates are depolymerized by mixtures of extracellular enzymes 
produced by consortia of fungi and bacteria. Thus, a mixture of E. chrysanthemi 
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enzymes and enzymes from the fungus T. reesei can improve the digestion of 
lignocellulosic substrates during bioconversion to ethanol. 

It will also be appreciated that a recombinant host may be further manipulated, 
using methods known in the art, to have mutations in any endogenous gene/s (e.g., 
recombinase genes) that would interfere with the stability, expression, function, and 
secretion of the introduced genes. Further, it will also be appreciated that the invention 
is intended to encompass any regulatory elements, gene/s, or gene products, i. e. , 
polypeptides, that are sufficiently homologous to the ones described herein. 

For effective degradation of oligosaccharides, the glucanase (e.g., EGY or EGZ) 
is preferably secreted into the extracellular millieu. Accordingly, in another 
embodiment of the invention, the host cell has been engineered to express a secretory 
protein/s to facilitate the export of the desired polypeptide from the cell. In one 
embodiment, the secretory protein or proteins are derived from a Gram-negative 
bacterial cell, e.g., a cell from the family Enterobacteriaceae. In another embodiment, 
the secretory protein/s are from Erwinia and are encoded by the out genes. In another 
embodiment, the secretory proteins are the pul genes derived from Klebsiella, The 
introduction of one or more of these secretory proteins is especially desirable if the host 
cell is an enteric bacterium, e.g., a Gram-negative bacterium having a cell wall. 
Representative Gram-negative host cells of the invention are from the family 
Enterobacteriaceae and include, e.g., Escherichia and Klebsiella. In one embodiment, 
the introduction of one or more secretory proteins into the host results in an increase in 
the secretion of the selected protein, e.g., a glucanase, as compared to naturally- 
occurring levels of secretion. Preferably, the increase in secretion is at least about 1 0% 
and more preferably, 100%, 200%, 300%, 400%, 500%, 600%, 700%, 800%, 900%, 
1 000%, or more, as compared to naturally-occurring levels of secretion. In a preferred 
embodiment, the addition of secretion genes allows for the glucanase polypeptide to be 
produced at higher levels. In a preferred embodiment, the addition of secretion genes 
allows for the glucanase polypeptide to be produced with higher enzymatic activity. In a 
most preferred embodiment, the glucanase is produced at higher levels and with higher 
enzymatic activity. Preferably, an increase in glucanase activity of at least about 10%, 
more preferably about 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, or 100% is 
observed. Most preferably, an increase in glucanase activity of several fold is obtained, 
e.g., about 200%, 300%, 400%, 500%, 600%, 700%, 800%, 900%, or 1000%, as 
compared to cells without secretion genes (e.g., cells that either lack or do not express 
secretion genes at a sufficient level). The techniques and methods for introducing such 
genes and measuring increased output of a desired polypeptide such as, e.g., a. 
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glucanase, are described in further detail in the examples. Other equivalent methods are 
known to those skilled in the art. 

EGY and EGZ are secreted by different mechanisms. As described in Example 
4, approximately 70% of the EGZ produced was secreted as an extracellular product 
when the E. chrysanthemi out genes were added on a plasmid (pCPP2006) consistent 
with a Type II secretion system (Hueck, CJ. (1998) Microbiol Mol Biol Rev. 62:379- 
433). Half of the EGY activity was secreted in the presence or absence of the out genes 
consistent with a Type IV secretion system (Hueck, C.J. supra). 

Methods for screening strains having the introduced genes (e.g., endoglucanase 
encoding genes such as celYand celZ and alcohol dehydrogenase genes) are routine and 
may be facilitated by visual screens that can identify cells expressing either the alcohol 
dehydrogenase (ADH) or glucanase (e.g., EGZ or EGY) gene product. The ADH gene 
product produces acetaldehyde that reacts with the leucosulfonic acid derivative of p- 
roseaniline to produce an intensely red product. Thus, ADH-positive clones can be 
easily screened and identified as bleeding red colonies. Methods for screening for a 
polysaccharase activity (e.g., an endoglucanase , such as EGZ or EGY), also results in a 
clear visual phenotype as described below and in the examples. 

Methods for screening for synergism between two endoglucanases are described 
in detail in the examples. EGY and EGZ can be purified separately from recombinant 
host cells transformed with a plasmid containing the celY gene or the celZ gene. The 
cell-free culture broth can be used to determine extracellular endoglucanase activity. 
Broth containing cells disrupted by ultrasound can be used to determine total activity. 
Hydrolysis of cellooligosaccharides can be analyzed by thin layer chromatography. 
Additionally, endoglucanase activity using CMC as a substrate can be determined in 
vitro by analyzing samples for reducing sugars. Reducing sugars can be measured using 
3,5-dinitrosalicyclic acid reagent with glucose as a standard. Synergy is calculated as 
the observed activity divided by the sum of predicted contributions from EGY alone 
(10%) plus EGZ alone (90%). 

Alternatively, recombinant host cells can be transformed with a plasmid 
containing both endoglucanases to produce both endoglucanases concurrently. Synergy 
can be determined as described above. Simultaneous saccharification and fermentation 
(SSF) can be carried out to observe ethanol production from a substrate (as described in 
Example 4). In SSF experiments with added commercial cellulase, two of the K. 
oxytoca recombinants containing at least one E. chrysanthemi endoglucanase produced 
more ethanol than the parent K. oxytoca strain which lacked the E. chrysanthemi 
endoglucanases. Both of these strains also produced ethanol levels equivalent to the 
best yeast SSF experiments (Cho 5 et al (1999) J. Microbiol Biotechnol 9:340-345) 
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using approximately one-third of the amount of added commercial cellulase (500 FPU/g 
cellulose versus 18 FPU/g cellulose for recombinant yeast). 

Recombinant bacteria expressing, for example, the PET operon typically grow to 
higher cell densities in liquid culture than the unmodified parent organisms due to the 
production of neutral rather than acidic fermentation products (Ingram et aL, (1988) 
Appl. Environ. Microbiol 54:397-404). On plates, ethanologenic clones are readily 
apparent as large, raised colonies which appear much like yeast. These traits have been 
very useful during the construction of new strains and can provide a preliminary 
indication of the utility of new constructs. Rapid evaluations of ethanol producing 
potential can also be made by testing the speed of red spot development on aldehyde 
indicator plates (Conway et aL , (1 987) J. Bacteriol 169:2591-2597). Typically, strains 
which prove to be efficient in sugar conversion to ethanol can be recognized by the 
production of red spots on aldehyde indicator plates within minutes of transfer. 

In a most preferred embodiment of the invention, a single host cell is 
ethanologenic, that is, has all the necessary genes, either naturally occurring or 
artificially introduced or enhanced (e.g., using a surrogate promoter and/or genes from a 
different species or strain), such that the host cell has the ability to produce and secrete 
two glucanses, preferably, an endoglucanase, more preferably at least two 
endoglucanases sufficient to , degrade a complex sugar, and ferment the degraded sugar 
into ethanol. Accordingly, such a host is suitable for simultaneous saccharification and 
fermentation. 

Moreover, the present invention takes into account that the native E. coli 
fermentation pathways produce a mixture of acidic and neutral products (in order of 
abundance): lactic acid, hydrogen + carbon dioxide (from formate), acetic acid, ethanol, 
and succinate. However, the Z mobilis PDC (pyruvate decarboxylase) has a lower Km 
for pyruvate than any of the competing E. coli enzymes. By expressing high activities 
of PDC, carbon flow is effectively redirected from lactic acid and acetyl-CoA into 
acetylaldehyde and ethanol. Small amounts of succinate can be eliminated by deleting 
the fumarate reductase gene (frd) (Ingram et aL, (1991) U.S.P.N, 5,000,000; Ohta et aL, 
(1991) AppL Environ. Microbiol. 57:893-900). Additional mutations (e.g., in the pfl or 
Idh genes) may be made to completely eliminate other competing pathways (Ingram et 
aL, (1991) U.S.P.N, 5,000,000). Additional mutations to remove enzymes (e.g., 
recombinases, such as recA) that may compromise the stability of the introduced genes 
(either plasmid-based or integrated into the genome) may also be introduced, selected 
for, or chosen from a particular background. 

In addition, it should be readily apparent to one skilled in the art that the ability 
conferred by the present invention, to transform genes coding for a protein or an entire 
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metabolic pathway into a single manipulable construct, is extremely useful. Envisioned 
in this regard, for example, is the application of the present invention to a variety of 
situations where genes from different genetic loci are placed on a chromosome. This 
may be a multi-cistronic cassette under the control of a single promoter or separate 
promoters may be used. 

Exemplary E. coli strains that are ethanologenic and suitable for further 
improvement according to the methods of the invention include, for example, K04, 
KOI 1, and KOI 2 strains, as well as the LY01 strain, an ethanol-tolerant mutant of the E. 
coli strain KOI 1 . Ideally, these strains may be derived from the E. coli strain ATCC 
1 1303, which is hardy to environmental stresses and can be engineered to be 
ethanologenic and secrete a polysaccharase/s. In addition, recent PCR investigations 
have confirmed that the ATCC 1 1303 strain lacks all genes known to be associated with 
the pathogenicity of E. coli (Kuhnert et al, (1997) Appl Environ. Microbiol 63:703- 
709). 

Another preferred ethanologenic host for improvement according to the methods 
of the invention is the E. coli KOI 1 strain which is capable of fermenting hemicellulose 
hydrolysates from many different lignocellulosic materials and other substrates (Asghari 
et al, (1996) J. Ind. Microbiol 16:42-47; Barbosa et al, (1992) Current Microbiol 
28:279-282; Beall etal, (1991) Biotechnol Bioeng. 38:296-303; Beall etal, (1992) 
Biotechnol Lett 14:857-862; Hahn-Hagerdal etal, (1994) Appl Microbiol Biotechnol 
41 :62-72; Moniruzzaman et al , (1 996) Biotechnol Lett 1 8:955-990; Moniruzzaman et 
al, (1998) Biotechnol Lett. 20:943-947; Grohmann et al , (1994) Biotechnol Lett 
16:281-286; Guimaraes et al, (1992) Biotechnol Bioeng. 40:41-45; Guimaraes et al, 
(1992) Biotechnol Lett. 14:415-420; Moniruzzaman et al, (1997) J. Bacteriol 
179:1880-1886). In Figure 1, the kinetics of bioconversion for this strain are shown. In 
particular, this strain is able to rapidly ferment a hemicellulose hydrolysate from rice 
hulls (which contained 58.5 g/L of pentose sugars and 37 g/L of hexose sugars) into 
ethanol (Moniruzzaman et al, (1998) Biotechnol Lett 20:943-947). It was noted that 
this strain was capable of fermenting a hemicellulose hydrolysate to completion within 
48 to 72 hours, and under ideal conditions, within 24 hours. 

Another preferred host cell of the invention is the bacterium Klebsiella. In 
particular, Klebsiella oxytoca is preferred because, like E. coli, this enteric bacterium 
has the native ability to metabolize monomeric sugars, which are the constituents of 
more complex sugars. Moreover, K. oxytoca has the added advantage of being able to 
transport and metabolize cellobiose and cellotriose, the soluble intermediates from the 
enzymatic hydrolysis of cellulose (Lai et al, (1996) Appl Environ. Microbiol 63:355- 
363; Moniruzzaman et al, (1997) Appl Environ. Microbiol 63:4633-4637; Wood etal, 
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(1992) Appl Environ. Microbiol 58:2103-2110). The invention provides genetically 
engineered ethanologenic derivatives of K. oxytoca, e.g., strain M5A1 having the Z 
mobilis pdc and adhB genes encoded within the PET operon (as described herein and in 
U.S.P.N. 5,821,093; Wood etal, (1992) Appl. Environ. Microbiol. 58:2103-2110). 

Accordingly, the resulting organism, strain P2, produces ethanol efficiently from 
monomer sugars and from a variety of saccharides including raffinose, stachyose, 
sucrose, cellobiose, cellotriose, xylobiose, xylotriose, maltose, etc. (Burchhardt et al, 
(1992) Appl. Environ. Microbiol. 58:1128-1133; Moniruzzaman etal, (1997) Appl. 
Environ. Microbiol. 63:4633-4637; Moniruzzaman et al. , (1997)./ Bacteriol. 179:1880- 
1886; Wood et al y (1992) Appl. Environ. Microbiol 58:2103-21 10). These strains may 
be further modified according to the methods of the invention to express and secrete one 
or more polysaccharases (e.g., endoglucanases). Accordingly, this strain is suitable for 
use in the bioconversion of a complex saccharide in an SSF process (Doran et al, (1993) 
Biotechnol Progress. 9:533-538; Doran etal, (1994) Biotechnol. Bioeng. 44:240-247; 
Wood et al , (1992) Appl Environ. Microbiol 58:2103-21 10). In particular, the use of 
this ethanologenic P2 strain eliminates the need to add supplemental cellobiase, and this 
is one of the least stable components of commercial fungal cellulases (Grohmann, 
(1994) Biotechnol. Lett, 16:281-286). 

Screen for Promoters Suitable for Use in Heterologous Gene Expression 

While in one embodiment, the surrogate promoter of the invention is used to 
improve the expression of a heterologous gene, e.g., a polysaccharase (an endoglucanase 
for example), it will be appreciated that the invention also allows for the screening of 
surrogate promoters suitable for enhancing the expression of any desirable gene product. 
In general, the screening method makes use of the cloning vector described in Example 
1 and depicted in Figure 3 that allows for candidate promoter fragments to be 
conveniently ligated and operably-linked to a reporter gene. In one embodiment, the 
celZ gene encoding glucanase serves as a convenient reporter gene because a strong 
colorimetric change results from the expression of this enzyme (glucanase) when cells 
bearing the plasmid are grown on a particular media (CMC plates). Accordingly, 
candidate promoters, e.g., a particular promoter sequence or, alternatively, random 
sequences that can be "shotgun" cloned and operably linked to the vector, can be 
introduced into a host cell and resultant colonies are scanned, visually, for having 
increased gene expression as evidenced by a phenotypic glucanase-mediated 
colorimetric change on a CMC plate. Colonies having the desired phenotype are then 
processed to yield the transforming DNA and the promoter is sequenced using 
appropriate primers (see Example 1 for more details). 
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The high correspondence between the glucanase-mediated colorimetric change 
on a CMC plate and expression levels of the enzyme is an excellent indication of the 
strength of a candidate promoter (Fig. 4). Hence, the methods of the invention provide a 
rapid visual test for rating the strength of candidate surrogate promoters. Accordingly, 
5 depending on the desired expression level needed for a specific gene product, a 

particular identified surrogate promoter can be selected using this assay. For example, if 
simply the highest expression level is desired, then the candidate promoter that produces 
the largest colorimetric change may be selected. If a lower level of expression is 
desired, for example, because the intended product to be expressed is toxic at high levels 

10 or must be expressed at equivalent levels with another product, a weaker surrogate 
promoter can be identified, selected, and used as described. 

The plasmid pLOI23 1 1 contains the celY coding region under the control of the 
lac promoter and pLOI23 1 6 was identified as a clone oriented to express celY from the 
lac promoter as determined by endoglucanase indicator plates. Replacement of the 

15 native promoter with the lac promoter increased celY expression by recombinant E.coli 
harboring this plasmid. In addition, to minimize problems associated with the 
expression of heterologous genes in industrial strains such as the ethanologenic K. 
oxytoca P2 strain, unregulated promoters were isolated from random fragments of Z 
mobilis DNA using functional assays. Using this method, a plasmid containing a Z 

20 mobilis Sau3A\ fragment as a heterolgous was constructed (pLOI2323) (see example 4 
and Figures 13 and 14). In addition, high levels of EGZ were produced by E.coli 
harboring the plasmid pLOI1620. See examples 3 and 4 for more details concerning the 
construction of the plasmids and selection of heterologous promoters. 

Example 4 describes the construction of the celY, celZ integration vector with 

25 surrogate promoters (pLOI2352) (See Figure 15). To ensure the stability of this 

plasmid, hybrid genes were integrated into the chromosome, and the antibiotic resistance 
markers used in construction were deleted using the FLP recombinase system (Martinez- 
Morales, et al (1999) J. Bacteriol 181:7143-7148) to facilitate further genetic 
modifications. 

30 

///. Methods of Use 

Degrading or Depolymerizing a Complex Saccharide 

In one embodiment, the host cell of the invention is used to degrade or 
depolymerize a complex sugar, e.g., lignocellulose or an oligosaccharide into a smaller 
35 sugar moiety. To accomplish this, the host cell of the invention preferably expresses 
one or more polysaccharases, e.g., endoglucanases, such as EGY and EGZ and these 
polysaccharases may be liberated naturally from the producer organism. Alternatively, 
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the polysaccharase is liberated from the producer cell by physically disrupting the cell. 
Various methods for mechanically (e.g., shearing, sonication), enzymatically (e.g., 
lysozyme), or chemically disrupting cells, are known in the art, and any of these 
methods may be employed. Once the desired polypeptide is liberated from the inner cell 
5 space it may be used to degrade a complex saccharide substrate into smaller sugar 

moieties for subsequent bioconversion into ethanol. The liberated polysaccharase may 
be purified using standard biochemical techniques known in the art. Alternatively, the 
liberated polysaccharase need not be purified or isolated from the other cellular 
components and can be applied directly to the sugar substrate. 

10 Accordingly, it will be appreciated by the skilled artisan that one or more 

polysaccharases can be selected for their activity and a composition may be formulated 
having an optimized activity, preferably synergistic activity, for degrading a complex 
sugar. The composition may take e.g., the form of an unpurified, semi-purified, or 
purified endoglucanase activity which is mixed with one or more endoglucanase 

1 5 activities in a ratio that provides optimal degrading of a complex sugar. Alternatively, 
each enzyme activity may be separately formulated with instructions for use, i.e., mixing 
or applying in a preferred order and/or ratio, in order to achieve optimal degrading of a 
complex sugar. 

In another embodiment, a host cell is employed that coexpresses one or more 

20 polysaccharases and a secretory protein/s such that the polysaccharases are secreted into 
the growth medium. This eliminates the above-mentioned step of having to liberate the 
polysaccharases from the host cell. When employing this type of host, the host may be 
used directly in an aqueous solution containing an oligosaccharide. 

In another embodiment, a host cell of the invention is designed to express more 

25 than one polysaccharase or is mixed with another host expressing a different 

polysaccharase in a ratio sufficient for the synergistic degradation of an oligosaccharide 
to occur. For example, one host cell could express a heterologous endoglucanase (e.g., 
EGY) while another host cell could express another endoglucanase (e.g., EGZ), and 
these cells could be combined to form a heterogeneous culture having synergistic 

30 activity in the degradation of oligosaccharides. Alternatively, in a preferred 
embodiment, a single host strain is engineered to produce all of the above 
polysaccharases. In either case, a culture of recombinant host/s is produced having high 
expression of the desired polysaccharases for application to an oligosaccharide. If 
desired, this mixture can be combined with an additional cellulase, e.g., an exogenous 

35 cellulase, such as a fungal cellulase. This mixture is then used to degrade a complex 
substrate. Alternatively, prior to the addition of the complex sugar substrate, the 
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polysaccharase/s are purified from the cells and/or media using standard biochemical 
techniques and used as a pure enzyme source for depolymerizing a sugar substrate. 

It will be appreciated by the skilled artisan, that the ethanol-producing bacterial 
strains of the invention are superior hosts for the production of recombinant proteins 
5 because, under anaerobic conditions (e.g. , in the absence of oxygen), there is less 

opportunity for improper folding of the protein (e.g., due to inappropriate disulfide bond 
formation). Thus, the hosts and culture conditions of the invention potentially result in 
the greater recovery of a biologically active product. 

10 Fermenting a Complex Saccharide 

In a preferred embodiment of the present invention, the host cell having the 
above mentioned attributes is also ethanologenic. Accordingly, such a host cell can be 
applied in synergistically degrading or depolymerizing a complex saccharide into a 
monosaccharide. Subsequently, the cell can catabolize the simpler sugar into ethanol by 

1 5 fermentation. This process of concurrent complex saccharide depolymerization into 
smaller sugar residues followed by fermentation is referred to as simultaneous 
saccharification and fermentation (SSF). 

Typically, fermentation conditions are selected that provide an optimal pH and 
temperature for promoting the best growth kinetics of the producer host cell strain and 

20 catalytic conditions for the enzymes produced by the culture (Doran et al, (1993) 
Biotechnol Progress. 9:533-538). For example, for Klebsiella, e.g., the P2 strain, 
optimal conditions were determined to be between 35-37° C and pH 5.0- pH 5.4. Under 
these conditions, even exogenously added fungal endoglucanases and exoglucanases are 
quite stable and continue to function for long periods of time. Other conditions are 

25 discussed in the Examples. Moreover, it will be appreciated by the skilled artisan, that 
only routine experimentation is needed, using techniques known in the art, for 
optimizing a given fermentation reaction of the invention. 

Currently, the conversion of a complex saccharide such as lignocellulose, is a 
very involved, multi-step process. For example, the lignocellulose must first be 

30 degraded or depolymerized using acid hydrolysis. This is then followed by steps that 
separate liquids from solids and these products are subsequently washed and detoxified 
to result in cellulose that can be further depolymerized (using added cellulases) and 
finally, fermented by a suitable ethanologenic host cell. In contrast, the fermenting of 
corn is much simpler in that amylases can be used to break down the corn starch for 

35 immediate bioconversion by an ethanologenic host in essentially a one-step process. 

Accordingly, it will be appreciated by the skilled artisan that the recombinant 
hosts and methods of the invention afford the use of a similarly simpler and more 
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efficient process for fermenting lignocellulose. For example, the method of the 
invention is intended to encompass a method that avoids acid hydrolysis altogether. 
Moreover, the hosts of the invention have the following advantages, 1) efficiency of 
pentose and hexose co-fermentation; 2) resistance to toxins; 3) production of enzymes 
for complex saccharide depolymerization; and 4) environmental hardiness. Therefore, 
the complexity of depolymerizing lignocellulose can be simplified using an improved 
biocatalyst of the invention. Indeed, in one preferred embodiment of the invention, the 
reaction can be conducted in a single reaction vessel and in the absence of acid 
hydrolysis, e.g., as an SSF process. 

Potential Substrates for Bioconversion into Ethanol 

One advantage of the invention is the ability to use a saccharide source that has 
been, heretofore, underutilized. Consequently, a number of complex saccharide 
substrates may be used as a starting source for depolymerization and subsequent 
fermentation using the host cells and methods of the invention. Ideally, a recyclable 
resource may be used in the SSF process. Mixed waste office paper is a preferred 
substrate (Brooks et al. 9 (1995) Biotechnol Progress. 11:619-625; Ingram etal, (1995) 
U.S.P.N. 5,424,202), and is much more readily digested than acid pretreated bagasse 
(Doran et al 9 (1994) Biotech. Bioeng. 44:240-247) or highly purified crystalline 
cellulose (Doran et al (1993) Biotechnol. Progress. 9:533-538). Glucanases, both 
endoglucanases and exoglucanases, contain a cellulose binding domain, and these 
enzymes can be readily recycled for subsequent fermentations by harvesting the 
undigested cellulose residue using centrifugation (Brooks et al, (1995) Biotechnol. 
Progress. 1 1 :6 19-625). By adding this residue with bound enzyme as a starter, ethanol 
yields (per unit substrate) were increased to over 80% of the theoretical yield with a 
concurrent 60% reduction in fungal enzyme usage (Figure 2). Such approaches work 
well with purified cellulose, although the number of recycling steps may be limited with 
substrates with a higher lignin content. Other substrate sources that are within the scope 
of the invention include any type of processed or unprocessed plant material, e.g., lawn 
clippings, husks, cobs, stems, leaves, fibers, pulp, hemp, sawdust, newspapers, etc. 

This invention is further illustrated by the following examples, which should not 
be construed as limiting. 
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EXAMPLE 1 

Methods for Making Recombinant Escherichia Hosts Suitable for Fermenting 

Oligosaccharides into Ethanol 

In this example, methods for developing and using Escherichia hosts suitable for 
fermenting oligosaccharides into ethanol are described. In particular, a strong promoter 
is identified which can be used to increase the expression of a polysaccharase (e.g., 
glucanase). In addition, genes from Erwinia chrysanthemi are employed to facilitate 
polysaccharase secretion thereby eliminating the need for cell disruption in order to 
release the desired polysaccharase activity. 

Throughout this example, the following materials and methods are used unless 
otherwise stated. 

Materials and Methods 

Organisms and Culture Conditions 

The bacterial strains and plasmids used in this example are listed in Table 1 , 

below. 

For plasmid constructions, the host cell E. coli DH5a was used. The particular 
gene employed encoding a polysaccharase (e.g., glucanase) was the celZ gene derived 
from Erwinia chrysanthemi P86021 (Beall, (1995) Ph.D. Dissertation, University of 
Florida; Wood et al, (1997) Biotech. Bioeng. 55:547-555). The particular genes used 
for improving secretion were the out genes derived from E. chrysanthemi EC 16 (He et 
al., (1991) Proc. Natl Acad. Set USA. 88:1079-1083). 

Typically, host cell cultures were grown in Luria-Bertani broth (LB) (10 g L" 1 
Difco® tryptone, 5 g L" 1 Difco® yeast extract, 5 g L" 1 sodium chloride) or on Luria agar 
(LB supplemented with 15 g L" 1 of agar). For screening host cells having glucanase 
celZ activity (EGZ), CMC-plates (Luria agar plates containing carboxymethyl cellulose 
(3. g L" 1 )) were used (Wood et al, (1988) Methods in Enzymology 160:87-112). When 
appropriate, the antibiotics ampicillin (50 mg L" 1 ), spectinomycin (100 g L" 1 ), 
kanamycin (50 g L" 1 ) were added to the media for selection of recombinant or integrant 
host cells containing resistance markers. Constructs containing plasmids with a 
temperature conditional pSClOl replicon (Posfai et al, (1997) J. Bacteriol 
179:4426-4428) were grown at 30°C and, unless stated otherwise, constructs with 
pUC-based plasmids were grown at 37°C. 
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TABLE 1. Strains and Plasmids Used 



Strains/Plasm ids 


Description 


Sources/References 


Strains 


Z mobilis CP4 


Prototrophic 


Osman, et aL, (1985) J. 
Bact. 164:173-180 


E. coli strain DH5oc 


lacZ Ml 5 recA 


Bethesda Research 
Laboratory 


E. coli strain B 


Prototrophic 


ATCC 11303 


E. coli strain HB 101 


RecA lacY recA 


ATCC 37159 


Plasmids 


pUC19 


bla cloning vector 


New England Biolabs 


pST76-K 


kan low copy number, temp, sensitive 


Posfai, et al, (1997) J. 
BacterioL 179:4426-4428 


pRK2013 


kan mobilizing helper plasmid (mob+) 


ATCC 


pCPP2006 


Sp r , ca. 40 kbp plasmid carrying the complete out 
genes from E. chrysanthemi EC 1 6 


He, etal,. (1991) P.N.A.S. 
88:1079-1083 


pLOI1620 


bla celZ 


Beall, etal (1995) Ph.D. 
Dissertation, U. of Florida 


pLOI2164 


pLOI1620 with BamWl site removed (Klenow) 


See text 


pLOI2 1 70 


Ndel-Hindlll fragment (promoterless celZ) from 
pLOI2164 cloned into pUC19 


See text 

— — — — — - 


pLOI2171 


BamHl-Sphl fragment (promoterless celZ) from 
pLOI2 1 70 cloned into pST76-K 


See text 


pLOI2173 


EcoKl-Sphl fragment (celZ with native promoter) 
from pLOI2164 cloned into pST76-K 


See text 


pLOI2174 


EcoKl-BamHl fragment (gap promoter) cloned into 
pLOI2171 


See text 


pLOI2175 


EcoKl-BamHl fragment (eno promoter) cloned into 
pLOI2171 


See text 


pLOI2177 


Random SauSAl Z. mobilis DNA fragment cloned 
into pLOI2171 


See text 


pLOI2 1 78 


Random Sau3Al Z. mobilis DNA fragment cloned 
into pLOI2171 


^ee text 


pLOI2179 


Random Sau3Al Z mobilis DNA fragment cloned 
into pLOI2171 


See text 


pLOI2180 


Random Sau3A\ Z mobilis DNA fragment cloned 
into pLOI2171 


See text 


pLOI2181 


Random Saw3Al Z mobilis DNA fragment cloned 
into pLOI2171 


See text 


pLOI2182 


Random Saw3Al Z mobilis DNA fragment cloned 
into pLOI2171 


See text 


pLOI2183 


Random Sau3A\ Z mobilis DNA fragment cloned 
into pLOI2171 


See text 


pLOI2184 


Random Sau3A\ Z mobilis DNA fragment cloned 
into pLOI2171 


See text 


pLOI2196 


pLOI2177 fused into pUC19 at the Pstl site 


See text 


pLOI2197 


pLOI2180 fused into pUC19 at the Pstl site 


See text 


pLOI2 1 98 


pLOI2182 fused into pUC19 at the Pstl site 


See text 


pLOI2199 


pLOI2183 fused into pUC19 at the Pstl site 


See text 


pLOI2307 


EcoKl-Sphl fragment from pLOI2183 cloned into 
pUC19 


See text 
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Genetic Methods 

Standard techniques were used for all plasmid constructions (Ausubel et al 9 
(1987) Current Protocols in Molecular Biology, John Wiley & Sons, Inc.; Sambrook et 
al, (1989) Molecular cloning: a laboratory manual, 2 nd ed. C.S.H.L., Cold Spring 
5 Harbor, N. Y). For conducting small-scale plasmid isolation, the TELT procedure was 
performed. For large-scale plasmid isolation, the Promega® Wizard Kit was used. For 
isolating DNA fragments from gels, the Qiaquick® Gel Extraction Kit from Qiagen® 
was employed. To isolate chromosomal DNA from E. coli and Z mobilis the methods 
of Cutting and Yomano were used (Cutting et al 9 (1990), Genetic analysis, pp. 61-74, 

10 In, Molecular biological methods for Bacillus, John Wiley & Sons, Inc.; Yomano et al, 
(1993) J. Bacteriol 175:3926-3933). 

To isolate the two glycolytic gene promoters {e.g., gap and end) described 
herein, purified chromosomal DNA from E. coli DH5oc was used as a template for the 
PCR (polymerase chain reaction) amplification of these nucleic acids using the 

1 5 following primer pairs: gap promoter, 5'- 

CG AATTCCTGCCG A AGTTT ATTAGCC A-3 ' (SEQ ID NO: 3) and 5'- 
AAGGATCCTTCCACCAGCTATTTGTTAGTGA-3' (SEQ ID NO: 4); eno promoter, 
5 ' - AGAATTCTGCC AGTTGGTTGACGAT AG-3 ' (SEQ ID NO: 5) and 5'- 
CAGGATCCCCTCAAGTCACTAGTTAAACTG-3' (SEQ ID NO: 6). The out genes 

20 encoding secretory proteins derived from E. chrysanthemi (pCPP2006) were conjugated 
into E. coli using pRK2013 for mobilization (Figurski et al, (1979) Proc. Natl Acad 
Sci. USA. 76: 1648-1652; Murata et al, (1990) J. Bacteriol 172:2970-2978). 

To determine the sequence of various DNAs of interest, the dideoxy sequencing 
method using fluorescent primers was performed on a LI-COR Model 4000-L DNA 

25 Sequencer. The pST76-K-based plasmids were sequenced in one direction using a T7 
primer (5'-TAATACGACTCACTATAGGG-3 ' (SEQ ID NO: 7)). ThepUC18-and 
pUCI9-based plasmids were sequenced in two directions using either a forward primer 
(5'-CACGACGTTGTAAAACGAC-3' (SEQ ID NO: 8)) or a reverse primer (5'- 
TAAC AATTTC ACACAGGA-3 ' (SEQ ID NO: 9)). The extension reactions of the 

30 sequencing method were performed using a Perkin Elmer GeneAmp® PCR 9600 and 
SequiTherm Long-Read Sequencing Kit-LC®. Resultant sequences were subsequently 
analyzed using the Wisconsin Genetic Computer Group (GCG) software package 
(Devereux et al, (1984) Nucleic Acids Rev, 12:387-395). 

To determine the start of transcriptional initiation in the above-mentioned 

35 promoters, primer extension analysis was performed using standard techniques. In 
particular, promoter regions were identified by mapping the transcriptional start sites 
using a primer finding correspondence within the celZ gene RNA that was isolated from 
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cells in late exponential phase using a Qiagen RNeasy® kit. Briefly, cells were treated 
with lysozyme (400 |ig/ml) in TE (Tris-HCl, EDTA) containing 0.2 M sucrose and 
incubated at 25° C for 5 min prior to lysis. Liberated RNA was subjected to ethanol 
precipitation and subsequently dissolved in 20 |il of Promega® AMV reverse 
5 transcriptase buffer (50 mM Tris-HCl, pH 8.3, 50 mM KC1, 10 mM MgCl 2 , 0.5 mM 
spermadine, lOmMDTT). An IRD41 -labeled primer (5*- 

GACTGGATGGTTATCCGAATAAGAGAGAGG-3' (SEQ ID NO: 10)) from Ll-Cor 
Inc. was then added and the sample was denatured at 80° C for 5 min, annealed at 55° C 
for 1 hr, and purified by alcohol precipitation. Annealed samples were dissolved in 19 
10 jil of AMV reverse transcriptase buffer containing 500 |uM dNTPs and 10 units AMV 
reverse transcriptase, and incubated for extension (1 h at 42°C). Products were treated 
with 0.5 |ig/ml DNase-free RNase A, precipitated, dissolved in loading buffer, and 
compared to parallel dideoxy promoter sequences obtained using the LI-COR Model 
4000-L DNA sequencer. 

15 

Polysaccharase Activity 

To determine the amount of polysaccharase activity (e.g., glucanase activity) 
resulting from expression of the celZ gene, a Congo Red procedure was used (Wood et 
aL, (1988) Methods in Enzymology 160:87-1 12). In particular, selected clones were 

20 transferred to gridded CMC plates and incubated for 18 h at 30° C and then stained and 
recombinant host cells expressing glucanase formed yellow zones on a red background. 
Accordingly, the diameters of these colorimetric zones were recorded as a relative 
measure of celZ expression. 

Glucanase activity (EGZ) was also measured using carboxymethyl cellulose as a 

25 substrate. In this test, appropriate dilutions of cell-free culture broth (extracellular 

activity) or broth containing cells treated with ultrasound (total activity) were assayed at 
35° C in 50 mM citrate buffer (pH 5.2) containing carboxymethyl cellulose (20 g L' 1 ). 
Conditions for optimal enzyme release for 3-4 ml samples were determined to be 4 
pulses at full power for 1 second each using a cell disruptor (Model W-220F, Heat 

30 System-Ultrasonics Inc., Plainview, NY). To stop the enzyme reactions of the assay, 
samples were heated in a boiling water bath for 10 min. To measure reducing sugars 
liberated enzymatically by the glucanase, a dinitrosalicylic acid reagent was employed 
using glucose as a standard (Wood et al 7 (1988) Methods in Enzymology 160:87-1 12). 
The amount of enzyme activity (IU) was expressed as jamols of reducing sugar released 

35 per min or as a percentage of total activity from an average of two or more 
determinations. 
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Ultrastructural Analysis 

To determine the infrastructure of various recombinant host cells, fresh colonies 
from Luria agar plates were prepared for analysis by fixing in 2% glutaraldehyde in 0.2 
M sodium cacodylate buffer (pH 7) followed by incubation in 1% osmium tetroxide and 
5 followed by 1% uranyl acetate in distilled water. Samples were dehydrated in ethanol, 
embedded in Spurr's plastic, and ultrathin sections were prepared and examined using a 
Zeiss® EM-IOCA electron microscope (Spur (1969) J. Ultrastruct Res. 26:31). 

Construction of a Low Copy Promoter Probe Vector Using celZ as the Reporter Gene 

10 To facilitate the isolation of strong promoters, a low copy vector was constructed 

with a pSClOl replicon and a BamHl site immediately preceding a promoterless celZ 
™, gene (pLO 12171). Accordingly, this promoterless plasmid was used as a negative 

control. The plasmid pLOI1620 was used as a source of celZ and is a pUC18 derivative 
y» with expression from consecutive lac and celZ promoters. The BamHl site in this 

fn 1 5 plasmid was eliminated by digestion and Klenow treatment (pLOI2 1 64). The celZ gene 

fil was isolated as a promoterless Ndel fragment after Klenow treatment. The resulting 

r\ blunt fragment was digested with Hindlll to remove downstream DNA and ligated into 

pUC19 (Hindlll to Hindi) to produce pLOI2170. In this plasmid, celZ is oriented 
J;? opposite to the direction of lacZ transcription and was only weakly expressed. The 

20 BamHl (amino terminus)-S/?M (carboxyl terminus) fragment from pLOI2 1 70 containing 
"Jf celZ was then cloned into the corresponding sites of pST76-K, a low copy vector with a 

temperature sensitive replicon, to produce pLOI2171 (Fig. 3). Expression of celZ in this 
vector was extremely low facilitating its use as a probe for candidate strong promoters. 

25 Analysis of celZ Expression from Two E. coli Glycolytic Promoters (gap and eno) 

Two exemplary promoters driving glycolytic genes (gap and eno) in E, coli were 
examined for their ability to drive the expression of the heterologous celZ gene encoding 
glucanase. Chromosomal DNA from the E. coli DH5cc strain was used as a template to 
amplify the gap and eno promoter regions by the polymerase chain reaction. The 

30 resulting fragments of approximately 400 bp each were digested with EcoTU and BamHl 
and cloned into the corresponding sites in front of a promoterless celZ gene in 
pLOI2171 to produce pLOI2174 (gap promoter) and pLOI2175 (eno promoter). As a 
control, the EcoRl-Sphl fragment from pLOI2164 containing the complete celZ gene 
and native E. chrysanthemi promoter was cloned into the corresponding sites of 

35 pST76-K to produce pLOI2173. These three plasmids were transformed into E. coli 

strains B and DH5a and glucanase activity (EGZ) was compared. For both strains of E. 
coli, glucanase activities were lower on CMC plates with E. coli glycolytic promoters 
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than with pLOI2173 containing the native E. chrysanthemi promoter (Table 2). 
Assuming activity is related to the square of the radius of each zone (Fick's Law of 
diffusion), EGZ production with glycolytic promoters (pLOI2174 and pLOI2175) was 
estimated to be 33% to 65% lower than in the native promoter construct (pLOI2373). 
Accordingly, other candidate promoters for driving high levels of celZ gene expression 
were investigated. 

Identifying and Cloning Random DNA Fragments Suitable for Use as Promoters for 
Heterologous Gene Expression 

Random fragments derived from Z mobilis can be an effective source of 
surrogate promoters for the high level expression of heterologous genes in E. coli. 
(Conway et aL, (1987) J. Bacteriol 169:2327-2335; Ingram et al, (1988) AppL Environ. 
Micro. 54:397-404). Accordingly, to identify surrogate promoters for Erwinia celZ 
expression, Z mobilis chromosomal DNA was extensively digested with Sau3Al and 
resulting fragments were ligated into pLOI2171 at the BamHl site and transformed into 
E. coli DH5a to generate a library of potential candidate promoters. To rapidly identify 
superior candidate promoters capable of driving celZ gene expression in E. coli, the 
following biological screen was employed. Colonies transformed with celZ plasmids 
having different random candidate promoters were transferred to gridded CMC plates 
and stained for glucanase activity after incubation (Table 2). Approximately 20% of the 
18,000 clones tested were CMC positive. The 75 clones which produced larger zones 
than the control, pLOI2173, were examined further using another strain, E. coli B. 
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TABLE 2. Evaluation of promoter strength for ce/Z expression in E. coli using CMC 
indicator plates. 





E. coli DH5oc host 


E. coli B host 


— — — 

Plasmids 


Number of 


CMC zone 


% of native 


Number 


CMC zone 


% of native 




Plasmids a 


diameter (mm) b 


promoter 


of 


diameter (mm) 


promoter 








(100*R yR c ) 


plasmids 




(100*R j/R c ) 


pLUlzl / 1 


1 


0 


-- 


— 


— 


— 


(promoterless) 














pLOI2173 


1 


5.0 


100 


1 


4.5 


100 


^naiive 














promoter) 














pLOI2174 


1 


4.0 


77 


1 


3.5 


60 
















promoter) 














pLOI2175 


1 


3.0 


43 


1 


2.8 


35 


(eno 














promoter) 














Z. mobilis 














promoters 














Group I 


5 


13.0 


676 


4 


10.8-11.3 


570-625 


Group II 


14 


9.0-11.0 


324-484 


17 


9.0-10.5 


445-545 


Group III 


56 


6.0-9.0 


144-324 


54 


5.0-8.8 


125-375 



a The number of clones which the indicated range of activities. 

b The average size of the diameters from three CMC digestion zones. 

R x is the square of the radius of the clear zone with the test plasm id; R c is the square of the radius of 
the clear zone for the control (pLOI2173). 



Thus, promoter strength for selected candidate promoters was confirmed in two 
different strains with, in general, recombinants of DH5a producing larger zones (e.g., 
more glucanase) than recombinants of strain B. However, relative promoter strength in 
each host was similar for most clones. Based on these analyses of glucanase production 
as measured by zone size using CMC plates, four clones appeared to express celZ at 
approximately 6-fold higher levels than the construct with the original E. chrysanthemi 
celZ gene (pLOI2173), and at 10-fold higher levels than either of the E. coli glycolytic 
promoters. Accordingly, these and similarly strong candidate promoters were selected 
for further study. 

Production and Secretion of Glucanase 

Eight plasmid derivatives of pST76-K (pLOI2177 to pLOI2184) were selected 
from the above-described screen (see Group I and Group II (Table 2)) and assayed for 
total glucanase activity in E. coli strain B (Table 3). The four plasmids giving rise to the 
largest zones on CMC plates were also confirmed to have the highest glucanase 
activities (pLOI2177, pLOI2180, pLOI2182, and pLOI2183). The activities were 
approximately 6-fold higher than that of the unmodified celZ (pLOI2173), in excellent 
agreement with our estimate using the square of the radius of the cleared zone on CMC 
plates. Figure 4 shows a comparison of activity estimates from CMC plates and in vitro 
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enzyme assays for strain B containing a variety of different promoters, with and without 
the addition of out genes encoding secretory proteins. Although there is some scatter, a 
direct relationship is clearly evident which validates the plate method for estimating 
relative activity. The original construct in pUC18, a high copy plasmid, was also 
included for comparison (pLOI2164). This construct with consecutive lac and celZ 
promoters produced less EGZ activity than three of the low copy plasmids with 
surrogate promoters (pLOI2177, pLOI2182, and pLOI2183). Thus, to increase celZ 
expression of glucanase even more, the DNA fragment containing celZ and the most 
effective surrogate promoter was isolated from pLOI2183 (as a EcoKL-Sphl fragment) 
and inserted into pUC19 with transcription oriented opposite to that of the lac promoter 
(pLOI2307). Accordingly, the above-identified strong surrogate promoter when 
incorporated into a high copy plasmid, further increased glucanase activity by 2-fold. 

Engineering Increased Secretion of Glucanase 

To further improve on the above-described results for increasing expression of 
celZ encoded glucanase, the above host cells were engineered for increased secretion. 
Genes encoding secretory proteins {e.g., the out genes) derived from E. chrysanthemi 
EC 16 were used for improving the export of the glucanase using the plasmid as 
described in He et al that contains out genes (pCPP2006) (He et al 9 (1991) Proa Natl 
Acad. Set USA. 88:1079-1083). The increased secretion of EGZ in E. coli B was 
investigated and results are presented in Table 3. 



TABLE 3. Comparison of promoters for EGZ production and secretion in E. coli B 



Plasmids 3 


Without secretion genes 


With secretion genes (pCPP2006) 


Total activity (IU/L)° 


Extracellular c (%) 


Total Activity (IU/L) 


Extracellular 
(%) 


pLOI2173 


620 


17 


1,100 


43 


pLOI2177 


3,700 


10 


5,500 


44 


pLOI2178 


2,200 


9 


3,500 


49 


pLOI2179 


2,000 


10 


3,000 


50 


pLOI2180 


2,900 


8 


6,300 


39 


pLOI2181 


1,800 


11 


4,100 


46 


pLOI2 1 82 


3,500 


7 


6,600 


38 


pLOI2 1 83 


3,400 


7 


6,900 


39 


pLOI2184 


2,100 


12 


2,400 


39 


pLOI2164 


3,200 


20 \ 


6,900 


74 


pLOI2307 


6,600 


28 


13,000 


60 



a Plasmids pLOI2173 and pLO!2164 contain the celZ native promoter; pLOI2307 contains the strong 
promoter from pLOI2183. 

Plasmids pLOI2164 and pLOI2307 are pUC-based plasmids (high copy number). All other plasmids are 
derivatives of pST76-K (low copy number). 

b Glucanase activities were determined after 16 h of growth at 30°C. 
c Extracellular activity (secreted or released). 
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Recombinant hosts with low copy plasmids produced only 7- 17% of the total 
EGZ extracellularly (after 16 hours of growth) without the additional heterologous 
secretory proteins (out proteins encoded by plasmid pCPP2006). A larger fraction of 
5 EGZ (20-28%) was found in the extracellular broth surrounding host cells with the 
high-copy pUC-based plasmids than with the low copy pST76-based plasmids 
containing the same promoters. However, in either case ? the addition of out genes 
encoding secretory proteins (e.g., pCPP2006) increased the total level of expression by 
up to 2-fold and increased the fraction of extracellular enzyme (38-74%) by 
10 approximately 4-fold. The highest activity, 13,000 IU/L of total glucanase of which 
7,800 IU/L was found in the cell-free supernatant was produced by strain B having both 
pLOI2307 encoding celZ driven by a strong surrogate promoter and pCPP2006 encoding 
out secretory proteins). 

31 It has been reported that under certain conditions (pH 7, 37° C), the specific 

IT? I 

jrJJ 1 5 activity for pure EGZ enzyme is 4 1 9 IU (Py et al. , ( 1 99 1 ) Protein Engineering 4:325- 
fU 333) and it has been determined that EGZ produced under these conditions is 25% more 

active than under the above-mentioned conditions (pH 5.2 citrate buffer, 35° C). 
g Accordingly, assuming a specific activity of 3 16 IU for pure enzyme at pH 5.2 (35°C), 

CI the cultures of E. coli B (containing pLOI2307 and pCPP2006, e.g., plasmids encoding 

y a 20 glucanase and secretory proteins), produced approximately 41 mg of active EGZ per 
y*J liter or 4-6% of the total host cell protein was active glucanase. 

Sequence Analysis of the Strongest Promoter Derived from Z. mobilis 

The sequences of the four strongest surrogate promoters (pLOI2177, pLOI2180, 
25 pLOI2182, and pLOI2183) were determined. To facilitate this process, each was fused 
with pUC 1 9 at the Pstl site. The resulting plasmids, pLOI2 1 96, pLOI2 1 97, pLOI2 1 98, 
and pLOI2199, were produced at high copy numbers (ColEI replicon) and could be 
sequenced in both directions using Ml 3 and T7 sequencing primers. All four plasmids 
contained identical pieces of Z mobilis DNA and were siblings. Each was 141 7 bp in 
30 length and contained 4 internal Sau3AI sites. DNA and translated protein sequences 
(six reading frames) of each piece were compared to the current data base. Only one 
fragment (281 bp internal fragment) exhibited a strong match in a BLAST search 
(National Center for Biotechnology Information; http://www.ncbi.nlm.nih.gov/BLAST/ ) 
and this fragment was 99% identical in DNA sequence to part of the Z mobilis hpnB 
35 gene which is proposed to function in cell envelope biosynthesis (Reipen et al , (1995) 
Microbiology 141 :155-161). Primer extension analysis revealed a single major start site, 
67 bp upstream from the Sau3 Al/BamHl junction site with celZ, and a second minor 
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start site further upstream (Fig. 5). Sequences in the -10 and -35 regions were compared 
to the conserved sequences for E. coli sigma factors (Wang et ai, (1989) J. Bacteriol. 
180:5626-5631; Wise et al, (1996) J. Bacteriol 178:2785-2793). The dominant 
promoter region (approximately 85% of total start site) appears similar to a sigma 70 

-10 

promoter while the secondary promoter site resembles a sigma promoter. 

Microscopic Analysis of Recombinant Host Cells Producing Glucanase 

Little difference in cell morphology was observed between recombinants and the 
parental organism by light microscopy. Under the electron microscope, however, small 
polar inclusion bodies were clearly evident in the periplasm of strain B (pLOI2164) 
expressing high amounts of glucanase and these inclusion bodies were presumed to 
contain EGZ (Fig. 6). In the strain B (pLOI2307) that produced 2-fold higher glucanase 
activity the inclusion bodies were even larger and occupied up to 20% of the total cell 
volume. The large size of these polar bodies suggests that glucanase activity 
measurements may underestimate the total EGZ production. Typically, polar inclusion 
bodies were smaller in host cells also having constructs encoding the out secretory 
proteins, which allow for increased secretion of proteins from the periplasmic space. As 
expected, no periplasmic inclusion bodies were evident in the negative control strain B 
(pUC19) which does not produce glucanase. 
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EXAMPLE 2 

Recombinant Klebsiella Hosts Suitable for Fermenting Oligosaccharides into 

Ethanol 

In this example, a recombinant Klebsiella host, suitable for use as a biocatalyst 
for depolymerizing and fermenting oligosaccharides into ethanol, is described. 

Throughout this example, the following materials and methods are used unless 
otherwise stated. 
Materials and Methods 
Bacteria, Plasmids, and Culture Conditions 

The strains and plasmids that were used in this exemplification are summarized 
in Table 4 below. 



TABLE 4. Strains and Plasmids Used 



Strains/Plasm ids 


Properties 


Sou rces/Ref eren ces 


Strains 


Zymomonas mobilis 


Prototrophic 


Ingram et al. (1988) Appl 


CP4 




Environ. Micro. 54:397-404 


Escherichia coli 


DH5ot 


lacZ Ml 5 recA 


Bethesda Research Laboratory 


HB101 


recA lacY recA 


ATCC 37159 


Klebsiella oxytoca 


M5A1 


Prototrophic 


Wood et al (1992) Appl 






Environ. Micro. 58:2103-2110 


P2 


Pflrpdc adhB cat 


Wood et al. (1992) Appl. 




Environ. Micro. 58:2103-2110 


SZl 


pflrpdc adhB cat; integrated celZ\ tet 


See text 


SZ2 


pflr.pdc adhB cat; integrated celZ; tet 


See text 


SZ3 


pfl::pdc adhB cat, integrated celZ; tet 


See text 


SZ4 


pflr.pdc adhB cat; integrated celZ; tet 


See text 


SZ5 


pflrpdc adhB cat; integrated celZ; tet 


See text 


SZ6 


pflr.pdc adhB cat; integrated celZ; tet 


See text 


SZ7 


pflr.pdc adhB cat; integrated celZ; tet 


See text 


SZ8 


pflrpdc adhB cat; integrated celZ; tet 


See text 


SZ9 


pflrpdc adhB cat; integrated celZ; tet 


See text 


SZ10 


pflrpdc adhB cat; integrated celZ; tet 


See text 


Plasmids 






pUC19 


bla cloning vector 


New England Bio labs 


pBR322 


bla tet cloning vector 


New England Biolabs 


pLOI1620 


bla celZ 


Wood et al. (1997) Biotech. 




Bioeng. 55:547-555 


pRK2013 


kan mobilizing helper plasmid (mob + ) 


ATCC 


pCPP2006 


Sp r ? 40 kbp fragment containing out genes 


He etal. (1991) P.KA.S. 




from E. chrysanthemi EC 16 


88:1079-1083 


pST76-K 


kan low copy vector containing temperature 


Posfai et al (1997) J. Bact. 


sensitive pSClOl replicon 


179:4426-4428 


pLOI2164 


bla celZ (BamHl eliminated from 


See text 




pLOI1620) 




pLOI2173 


kan celZ (native celZ promoter) 


See text 
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TABLE 4. Strains and Plasmids Used (Continued) 



Strains/Plasm ids 


Properties 


Sources/References 


Plasmids (Continued 1 * 






pLOI2177 


kan celZ (surrogate promoter from Z 
mobilis) 


See text 


pLOI2178 


kan celZ (surrogate promoter from Z. 
mobilis) 


See text 


pLOI2179 


kan celZ (surrogate promoter from Z 
mobilis) 


See text 


DLOI2180 


kan celZ ( surroeate promoter from Z 


See text 


DLOI2181 


ce/Z f surrogate promoter from Z. 
mobilis) 


See text 


0LOI2 1 82 


kan celZ (surrogate oromoter from Z 
mobilis) 


See text 


nLOI2 1 83 


kan celZ ( surrogate oromoter from Z 
mobilis) 


See text 


dLOI2 1 84 


kan celZ ( surrogate oromoter from Z 
mobilis) 


See text 


St ra i n ^APl a <i in irl <i 

K 1 %\ 111 3/ J. 1(131H 1UI3 


Prnnerties 


Sources/References 


pLOI2185 


Az?w celZ (surrogate promoter from Z 

mobilises 


See text 


pLOI2186 


kan celZ (surrogate promoter from Z 
mohiliv\ 


See text 


pLOI2187 


ce/Z (surrogate promoter from Z 

mnhilis:} 


See text 


pLOI2188 


kan celZ (surrogate promoter from Z 
mobilis) 

fit- ^-f U It § 


See text 


pLOI2189 


kan celZ (surrogate promoter from Z 
mohilis\ 


See text 


d t OI2 190 


kan celZ f surrogate oromoter from Z 
mobilis) 


See text 


DLOI2191 


kan celZ ( surrosate Dromoter from Z 


See text 


pLOI2192 


kan celZ (surrogate promoter from Z 
mobilis) 


See text 


pLOI2193 


kan celZ (surrogate promoter from Z. 
mobilis) 


See text 


DLOI2194 


kan celZ (surrogate promoter from Z 
mobilis) 


See text 


pLOI2301 


Ascl linker inserted into Ndel site of pUC 1 9 


See text 


pLOI2302 


/IscI linker inserted into Sap\ site of 
pLOI2301 


See text 


pLOI2303 


Aval-EcoBl fragment from pBR322 
inserted into Pstl site of pLOI2302 after 
Klenow treatment 


See text 


pLOI2305 


EcoKL DNA fragment of K. oxytoca M5A1 
genomic DNA (ca. 2.5 kb) cloned into the 
Smal site of pLOI2303 


See text 


pLOI2306 


EcoRI-Sphl fragment from pLOI2 183 
cloned into EcoRA site of pLOI2305 


See text 
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The culture conditions used for cultivating E. coli and K. oxytoca M5A1 
typically employed Luria-Bertani broth (LB) containing per liter: 10 g Difco® tryptone, 
5 g yeast extract, and 5 g sodium chloride, or, alternatively, Luria agar (LB 
supplemented with 15 g of agar) (Sambrook et al } (1989), Molecular Cloning: A 
5 Laboratory Manual, C.S.H.L., Cold Spring Harbor, N. Y.). 

For screening bacterial colonies under selective conditions, CMC -plates (Luria 
agar plates containing 3 g L" 1 carboxymethyl cellulose) were used to determine levels of 
glucanase activity expressed by a given bacterial strain (Wood et al (1988) Enzymology, 
160:87-1 12). For cultivating ethanologenic strains, glucose was added to solid media 

10 (20 g L' 1 ) and broth (50 g L" 1 ). In determining glucanase activity, the glucose in the 
growth media was replaced with sorbitol (50 g L " 1 ), a non-reducing sugar. For 
cultivating various strains or cultures in preparation for introducing nucleic acids by 
electroporation, a modified SOC medium was used (e.g., 20 g L _1 Difco® tryptone, 5 g 
L' 1 , Difco® yeast extract, 10 mM NaCl, 2.5 mM KC1, 10 mM MgS0 4 , 10 mM MgCl 2 , 

15 and 50 g L" 1 glucose). The antibiotics ampicillin (50 mg L" 1 ), spectinomycin (100 mg 
L" 1 ), kanamycin (50 mg L" 1 ), tetracycline (6 or 12 mg L" 1 ), and chloramphenicol (40, 
200, or 600 mg L" 1 ) were added when appropriate for selection of recombinant hosts 
bearing antibiotic resistance markers. Unless stated otherwise, cultures were grown at 
37° C. Ethanologenic strains and strains containing plasmids with a 

20 temperature-sensitive pSClOl replicon were grown at 30° C. 

Genetic Methods 

For plasmid construction, cloning, and transformations, standard methods and E. 
coli DH5a hosts were used (Ausubel et al. (1987) Current Protocols in Molecular 

25 Biology. John Wiley & Sons, Inc.; Sambrook et al, (1989) Molecular Cloning: A 
Laboratory Manual, C.S.H.L., Cold Spring Harbor, N.Y.). Construction of the celZ 
integration vector, pLOI2306, was performed as shown in Figure 7. A circular DNA 
fragment lacking a replicon from pLOI2306 (see Figure 7) was electroporated into the 
ethanologenic K. oxytoca P2 using a Bio-Rad Gene Pulser using the following 

30 conditions: 2.5 kV and 25 (iF with a measured time constant of 3.8-4.0 msec 

(Comaduran et al (1998) Biotechnol. Lett. 20:489-493). The E. chrysanthemi EC 16 
secretion system (pCPP2006) was conjugated into K. oxytoca using pRK2013 for 
mobilization (Murata et al (1990) J. Bacteriol 172:2970-2978). Small scale and large 
scale plasmid isolations were performed using the TELT procedure and a Promega 

35 Wizard Kit, respectively. DNA fragments were isolated from gels using a Qiaquick® 
Gel Extraction Kit from Qiagen® (Qiagen Inc., Chatsworth, CA). Chromosomal DNA 
from K. oxytoca M5 Al and Z mobilis CP4 were isolated as described by Cutting and 
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Yomano (see Example 1). The DNAs of interest were sequenced using a LI-COR 
Model 4000-L DNA sequencer (Wood et al (1997) Biotech Bioeng. 55:547-555). 

Chromosomal Integration of celZ 
5 Two approaches were employed for chromosomal integration of celZ, using 

selection with a temperature-conditional plasmid (pLOI2183) using a procedure 
previously described for E. coli (Hamilton et ah, (1989)*/ Bacteriol 171:4617-4622) 
and direct integration of circular DNA fragments lacking a functional replicon. This 
same method was employed for chromosomal integration of Z. mobilis genes encoding 

10 the ethanol pathway in E. coli B (Ohta K et al, (1991) Appl. Environ. Microbiol. 
57:893-900) and A: oxytoca M5 Al (Wood et al (1992) Appl Environ. Microbiol 
58:2103-21 10). Typically, circular DNA was transformed into P2 by electroporation 
using a Bio-Rad Gene Pulser. Next, transformants were selected on solid medium 
containing tetracycline (6 mg L" 1 ) and grown on CMC plates to determine levels of 

15 glucanase activity. 

Glucanase Activity 

Glucanase activity resulting from expression of celZ gene product {i.e., 
glucanase) under the control of different test promoters was evaluated by staining CMC 

20 plates as described in Example 1 . This colorimetric assay results in yellow zones 

indicating glucanase activity and the diameter of the zone was used as a relative measure 
of celZ polypeptide expression. Clones that exhibited the largest zones of yellow color 
were further evaluated for glucanase activity at 35° C using carboxymethyl cellulose as 
the substrate (20 g L" 1 dissolved in 50 mM citrate buffer, pH 5.2) (Wood et al (1988) 

25 Methods in Enzymology 160: 87-1 12). In order to measure the amount of intracellular 
glucanase, enzymatic activity was released from cultures by treatment with ultra-sound 
for 4 seconds (Model W-290F cell disruptor, Heat System-Ultrasonics Inc., Plainview, 
NY). The amount of glucanase activity expressed was measured and is presented here 
as jamol of reducing sugar released per min (IU). Reducing sugar was measured as 

30 described by Wood (Wood et al (1988) Methods in Enzymology 160: 87-1 12) using a 
glucose standard. 



Substrate Depolymerization 

To further determine the amount of glucanase activity produced by various host 
35 cells, different carbohydrate substrates (20 g L" 1 suspended in 50 mM citrate buffer, pH 
5.2) were incubated with various cell extracts. In one example, test substrates 
comprising acid-swollen and ball-milled cellulose were prepared as described by Wood 
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5 f'i 



(pas. 



(Wood et al. (1988) Methods in Enzymology 160: 87-1 12). A typical polysaccharase 
extract (i.e., EGZ (glucanase) from Al oxytoca SZ6 (pCPP2006)) was prepared by 
cultivating the host cells at 30°C for 16 h in LB supplemented with sorbitol, a 
nonreducing sugar. Dilutions of cell-free broth were added to substrates and incubated 
5 at 35°C for 16 h. Several drops of chloroform were added to prevent the growth of 
adventitious contaminants during incubation. Samples were removed before and after 
incubation to measure reducing sugars by the DNS method (see, Wood et al (1988) 
Methods in Enzymology 160: 87-1 12). The degree of polymerization (DP) was 
estimated by dividing the total calculated sugar residues present in the polymer by the 
10 number of reducing ends. 

Fermentation Conditions 

Fermentations were carried out in 250 ml flasks containing 100 ml of Luria broth 
supplemented with 50 g L" 1 of carbohydrate. Test carbohydrates were sterilized 

1 5 separately and added after cooling. To minimize substrate changes, acid-swollen 
cellulose, ball-milled cellulose and xylan were not autoclaved. The antibiotic 
chloramphenicol (200 mg L' 1 ) was added to prevent the growth of contaminating 
organisms. Flasks were inoculated (10% v/v) with 24-h broth cultures (50 g L' 1 
glucose) and incubated at 35° C with agitation (100 rpm) for 24-96 h. To monitor 

20 cultures, samples were removed daily to determine the ethanol concentrations by gas 
chromatography (Dombekera/. (1986) Appl Environ. Microbiol 52:975-981). 

Methods for Isolating and Identifying a Surrogate Promoter 

In order to identify random fragments of Z mobilis that would serve as surrogate 
25 promoters for the expression of heterologous genes in Klebsiella and other host cells, a 
vector for the efficient cloning of candidate promoters was constructed as described in 
Example 1 (see also, Ingram et al (1988) Appl Environ. Microbiol 54:397-404). 

Next, &zu3AI digested Z mobilis DNA fragments were ligated into the BamRl 
site of pLOI2171 to generate a library of potential promoters. These plasmids were 
30 transformed into E. coli DH5a for initial screening. Of the 18,000 colonies individually 
tested on CMC plates, 75 clones produced larger yellow zones than the control 
(pLOI2173). Plasmids from these 75 clones were then transformed into K. oxytoca 
M5A1, re-tested, and found to express high levels of celZ in this second host. 
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Recombinant Klebsiella Hosts for Producing Polysaccharase 

The high expressing clones (pLOI2177 to pLOI2194) with the largest zones on 
CMC plates indicating celZ expression were grown in LB broth and assayed for 
glucanase activity (Table 5). 



TABLE 5. Evaluation of promoters for ce/Z expression and secretion in K. oxytoca M5A1 



Plasm ids 3 


No secretion genes 


Secretion genes present (pCPP2006) 


Total activity 
(IU L'') b 


Secreted activity 
(IU L" 1 ) 


Total activity 
(IU L _1 ) 


Secreted activity 
(IU L' 1 ) 


PLOI2 1 73 


2,450 


465 


3,190 


1,530 


PLOI2177 


19,700 


3,150 


32,500 


13,300 


PLOI2178 


15,500 


2,320 


21,300 


11,500 


PLOI2 1 79 


15,400 


2,310 


21,400 


12,000 


PLOI2180 


21,400 


3,210 


30,800 


13,600 


PLOI2181 


15,600 


2,490 


21,000 


11,800 


PLOI2182 


19,600 


3,130 


31,100 


14,000 


PLOI2183 


20,700 


3,320 


32,000 


14,000 


PLOI2184 


15,500 


2,480 


21,200 


11,900 


PLOI2185 


15,100 


2,420 


24,600 


11,500 


PLOI2186 


17,000 


2,380 


25,700 


13,400 


PLOI2187 


15,800 


2,210 


24,500 


12,200 


PLOI2188 


18,200 


2,180 


25,600 


12,000 


PLOI2189 


14,800 


2,360 


27,100 


12,700 


PLOI2190 


16,100 


2,410 


26,500 


12,500 


PLOI2191 


15,800 


2,210 


25,000 


12,400 


PLOI2192 


15,100 


1,810 


24,900 


12,500 


PLOI2193 


16,700 


2,010 


24,600 


12,800 


PLOI2194 


15,400 


2,770 


21,500 


11,900 



a pLOI2173 contains the celZ gene with the original promoter, all others contain the celZ gene with a Z 

mob His DNA fragment which serves as a surrogate promoter. 

b Glucanase (CMCase) activities were determined after 16 h of growth at 30°C. 



Activities with these plasmids were up to 8-foid higher than with the control 
plasmid containing the native celZ promoter (pLOI2 1 73). The four plasmids which 
produced the largest zones (pLOI2177 9 pLOI2180, pLOI2182 and pLOI2183) also 
produced the highest total glucanase activities (approximately 30,000 IU L" 1 ). One of 
these plasmids, pLOI2183, was selected for chromosomal integration. 

Chromosomal Integration of a Polysaccharase Gene 

To stably incorporate a desirable polysaccharase gene into a suitable host cell, 
e.g., Klebsiella P2 strain, a novel vector (pLOI2306) was constructed to facilitate the 
isolation of a DNA fragment which lacked all replication functions but contained the 
celZ gene with surrogate promoter, a selectable marker, and a homologous DNA 
fragment for integration (Figure 7). Two AscI sites were added to pUC19 by inserting a 
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linker (GGCGCGCC; SEQ ID NO: 1 1) into Klenow-treated Ndel and Sapl sites which 
flank the polylinker region to produce pLOI2302. A blunt fragment containing the tet 
resistance marker gene from pBR322 (excised with EcoRl and Aval followed by 
Klenow treatment) was cloned into the Pstl site of pLOI2302 (cut with Pstl, followed by 
5 Klenow treatment) to produce pLOI2303. To this plasmid was ligated a blunt fragment 
of K. oxytoca M5A1 chromosomal DNA (cut with EcoRl and made blunt with Klenow 
treatment) into the Smal site of pLOI2303 to produce (pLOI2305). The EcoRl - Sph\ 
fragment (Klenow treated) containing the surrogate Z. mobilis promoter and celZ gene 
from pLOI2183 was ligated into the EcoKl site of pLOI2305 (EcoW, Klenow treatment) 

10 to produce pLOI2306. Digestion of pLOI2306 with Ascl produced two fragments, the 
larger of which contained the celZ gene with a surrogate promoter, tet gene, and 
chromosomal DNA fragment for homologous recombination. This larger fragment (10 
kbp) was purified by agarose gel electrophoresis, circularized by self-ligation, and 
electroporated into the Klebsiella strain P2 and subsequently grown under selection for 

1 5 tetracycline resistance. The resulting 2 1 tetracycline-resistant colonies were purified 
and tested on CMC plates for glucanase activity. All were positive with large zones 
indicating functional expression of the celZ gene product. 

Clones used to produce the recombinant strains were tested for the presence of 
unwanted plasmids by transforming DH5a with plasmid DNA preparations and by gel 

20 electrophoresis. No transformants were obtained with 12 clones tested. However, two 
of these strains were subsequently found to contain large plasmid bands which may 
contain celZ and these were discarded. Both strains with large plasmids contained 
DNA, which could be sequenced with T7 and Ml 3 primers confirming the presence of 
multicopy plasmids. The remaining ten strains contain integrated celZ genes and could 

25 not be sequenced with either primer. 

The structural features of the novel vector pLOI2306 are schematically shown in 
Fig. 8 and the nucleotide sequence of the vector, including various coding regions {i.e., 
of the genes celZ, bla, and tet), are indicated in SEQ ID NO: 12 of the sequence listing. 
Nucleotide base pairs 3282-4281, which represent non-coding sequence downstream of 

30 the celZ gene (obtained from E. chrysanthemi), and base pairs 9476-1 1544 which 

represent a portion of the non-coding target sequence obtained from K. oxytoca M5A1, 
remain to be sequenced using standard techniques {e.g., as described in Sambrook, J. et 
al, T. Molecular Cloning: A Laboratory Manual. 2nd, ed, Cold Spring Harbor 
Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, (1989); 

35 Current Protocols in Molecular Biology, eds. Ausubel et al, John Wiley & Sons 

(1992)). For example, sufficient flanking sequence on either side of the aforementioned 
unsequenced regions of the pLOI2306 plasmid is provided such that sequencing primers 
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that correspond to these known sequences can be synthesized and used to carry out 
standard sequencing reactions using the pLOI2306 plasmid as a template. 

Alternatively, it will be understood by the skilled artisan that these unsequenced 
regions can also be determined even in the absence of the pLOI2306 plasmid for use as 
5 a template. For example, the remaining celZ sequence can be determined by using the 
sequence provided herein (e.g., nucleotides 1452-2735 of SEQ ID NO: 12) for 
synthesizing probes and primers for, respectively, isolating a celZ containing clone from 
a library comprising E. chrysanthemi sequences and sequencing the isolated clone using 
a standard DNA sequencing reaction. Similarly, the remaining target sequence can be 

10 determined by using the sequence provided herein (e.g., nucleotides 8426-9475 of SEQ 
ID NO: 12) for synthesizing probes and primers for, respectively, isolating a clone 
containing target sequence from a library comprising K. oxytoca M5A1 EcoRl 
fragments (e.g., of the appropriate size) and sequencing the isolated clone using a 
standard DNA sequencing reaction (a source of K. oxytoca M5A1 would be, e.g., ATCC 

15 68564 cured free of any plasmid using standard techniques). The skilled artisan will 
further recognize that the making of libraries representative of the cDNA or genomic 
sequences of a bacterium and the isolation of a desired nucleic acid fragment from such 
a library (e.g. , a cDNA or genomic library), are well known in the art and are typically 
carried out using, e.g., hybridization techniques or the polymerase chain reaction (PCR) 

20 and all of these techniques are standard in the art (see, e.g. , Sambrook, J. et al. , T. 

Molecular Cloning: A Laboratory Manual. 2nd, ed., Cold Spring Harbor Laboratory, 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, (1989); Current 
Protocols in Molecular Biology, eds. Ausubel et al, John Wiley & Sons (1992); 
Oligonucleotide Synthesis (MJ. Gait, Ed. 1984); and PCR Handbook Current Protocols 

25 in Nucleic Acid Chemistry, Beaucage, Ed. John Wiley & Sons (1999) (Editor)). 

Heterologous Gene Expression Using a Surrogate Promoter and Integrated or 
Plasmid-Based Constructs 

The ten integrated strains (SZ1-SZ10) were investigated for glucanase 
30 production in LB sorbitol broth (Table 6). All produced 5,000-7,000 IUL" 1 of active 
enzyme. Although this represents twice the activity expressed from plasmid pLOI2173 
containing the native celZ promoter, the integrated strains produced only 1/3 
the glucanase activity achieved by P2 (pLOI2183) containing the same surrogate Z. 
mobilis promoter (Table 5). The reduction in glucanase expression upon integration 
35 may be attributed to a decrease in copy number (i.e. , multiple copy plasmid versus a 
single integrated copy). 
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Secretion of Glucanase EGZ 

K. oxytoca contains a native Type II secretion system for pullulanase secretion 
(Pugsley (1993) Microbiol Rev. 57:50-108), analogous to the secretion system encoded 
by the out genes in Erwinia chrysanthemi which secrete pectate lyases and glucanase 
5 (EGZ) (Barras et al. (1994) Annu. Rev. Phytopathol 32:201-234; He et al. (1991) Proc. 
Natl. Acad Set USA. 88: 1079- 1083). Type II secretion systems are typically very 
specific and function poorly with heterologous proteins (He et al. (1991) Proc. Natl 
Acad. Set USA. 88: 1079- 1083; ¥y et al (1991) FEMS Microbiol Lett. 79:315-322; 
Sauvonnet et al (1996) Mol. Microbiol 22: 1-7). Thus as expected, recombinant celZ 

10 was expressed primarily as a cell associated product with either M5A1 (Table 5) or P2 
(Table 6) as the host. About 1/4 (12-26%) of the total recombinant EGZ activity was 
recovered in the broth. With E. coli DH5a, about 8-12% of the total extracellular EGZ 
was present. Thus the native secretion system in K. oxytoca may facilitate partial 
secretion of recombinant EGZ. 

15 To further improve secretion of the desired products, type II secretion genes (out 

genes) from E. chrysanthemi EC16 were introduced {e.g., using pCPP2006) to facilitate 
secretion of the recombinant EGZ from strain P86021 in ethanologenic strains of K. 
oxytoca (Table 5 and Table 6). For most strains containing plasmids with celZ, addition 
of the out genes resulted in a 5-fold increase in extracellular EGZ and a 2-fold increase 

20 in total glucanase activity. For strains with integrated celZ, addition of the out genes 
resulted in a 10-fold increase in extracellular EGZ and a 4-fold increase in total 
glucanase activity. In both cases, the out genes facilitated secretion of approximately 
half the total glucanase activity. The increase in EGZ activity resulting from addition of 
the out genes may reflect improved folding of the secreted product in both plasmid and 

25 integrated celZ constructs. The smaller increase observed with the pUC-based 

derivatives may result from plasmid burden and competition for export machinery 
during the production of periplasmic p-lactamase from the bla gene on this high copy 
plasmid. 

Two criteria were used to identify the best integrated strains of P2, growth on 
30 solid medium containing high levels of chloramphenicol (a marker for high level 

expression of the upstream pdc and adhB genes) and effective secretion of glucanase 
with the out genes. Two recombinant strains were selected for further study, SZ2 and 
SZ6. Both produced 24,000 IU L_1 of glucanase activity, equivalent to approximately 
5% of the total cellular protein (Py et al (1991) Protein Engin. 4:325-333). 



35 
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Substrate Depolymerization 

The substrate depolymerization of the recombinant EGZ was determined to be 
excellent when applied to a CMC source (Table 7). When applied to acid swollen 
cellulose, the activity of the glucanase was less than 10% of the activity measured for 
CMC activity. Little activity was noted when the polysaccharase was applied to Avicel® 
or xylan. However, when allowed to digest overnight, the EGZ polysaccharase resulted 
in a measurable reduction in average polymer length for all substrates. CMC and 
acid-swollen cellulose were depolymerized to an average length of 7 sugar residues. 
These cellulose polymers of 7 residues are marginally soluble and, ideally, may be 
further digested for efficient metabolization (Wood et al (1992) Appl Environ. 
Microbiol 58:2103-21 10). The average chain length of ball-milled cellulose and 
Avicel® was reduced to 1/3 of the original length while less than a single cut was 
observed per xylan polymer. 

TABLE 6. Comparison of culture growth, glucanase production, and secretion from 
ethanologenic K. oxytoca strains containing integrated ce/Z 



Strains 



P2 



SZl 



SZ2 



SZ3 



SZ4 



SZ5 



SZ6 



SZ7 



SZ8 



SZ9 



SZ10 



Growth on 
solid medium 
(600 mg L 
'CM) 



++++ 



++++ 



+++ 



+++ 



+ 



++++ 



+ 



+++ 



+++ 



Glucanase production and secretion (IU L"') 



No secretion system 



Total activity 



0 



6,140 



6,460 



5,260 



7,120 



6,000 



7,620 



6,650 



7,120 



7,530 



4,940 



Secreted 
activity 



0 



1,600 



1,160 



1,320 



1,070 



1,080 



1,520 



1,330 



854 



1,130 



939 



Adding secretion system 
(pCPP2006) 



Total activity 



0 



26,100 



23,700 



18,400 



23,200 



29,300 



24,300 



28,800 



28,700 



26,700 



17,000 



Glucanase (CMCase) activities were determined after 16 h of growth at 30°C. 



Secreted 
activity 



0 



14,300 



11,400 



8,440 



9,990 



15,500 



1 1 ,900 



15,500 



14,900 



12,800 



6,600 



TABLE 7. Depolymerization of various substrates by EGZ from cell free broth of strain 
SZ6 (pCPP2006) 



Substrates 



Carboxymethyl cellulose 



Acid-Swollen cellulose 



Ball-milled cellulose 

ngj 



Avicel 



Xylan from oat spelts 



Enzyme 
activity (IU/L) 



13,175 



893 



200 



41 



157 



Estimated degree of polymerization 



Before digestion 



224 



87 



97 



104 



110 



After digestion 



28 



35 



78 



Strain SZ6 (pCPP2006) was grown in LB-sorbitol broth for 16 h as a source of secreted EGZ 
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Saccharification and Fermentation Ability of a Biocatalyst 

To be useful, addition of celZ and out genes to strain P2 must not reduce the 
fermentative ability of the resulting biocatalyst. A comparison was made using glucose 
and cellobiose (Table 8). All strains were equivalent in their ability to ferment these 
5 sugars indicating a lack of detrimental effects from the integration of celZ or addition of 
pCPP2006. These strains were also examined for their ability to convert acid-swollen 
cellulose directly into ethanol. The most active construct SZ6 (pCPP2006) produced a 
small amount of ethanol (3.9 g L _1 ) from amorphous cellulose. Approximately 1.5 g L" 1 
ethanol was present initially at the time of inoculation for all strains. This decreased 

10 with time to zero for all strains except SZ6 (pCPP2006). Thus the production of 3.9 g L 
1 ethanol observed with SZ6 (pCPP2006) may represent an underestimate of total 
ethanol production. However, at best, this represents conversion of only a fraction of 
the polymer present. It is likely that low levels of glucose, cellobiose, and cellotriose 
were produced by EGZ hydrolysis of acid swollen cellulose and fermented. These 

1 5 compounds can be metabolized by the native phosphoenolpyruvate-dependent 
phosphotransferase system in K. oxytoca (Ohta Ket al, (1991) Appl Environ. 
Microbiol 57:893-900; Wood et al (1992) Appl Environ. Microbiol 58:2103-2110). 

TABLE 8. Ethanol production by strain SZ6 containing out genes (pCPP2006) and 
20 integrated ce/Z using various substrates (50 g L" 1 ) 



Strains 


Ethanol production (g L"') 


Glucose 


Cellobiose 


Acid-swollen cellulose 


P2 


22.9 


22.7 


0 


P2 (pCPP2006) 


22.6 


21.3 


0 


SZ6 


21.5 


19.7 


0 


SZ6 (pCPP2006) 


22.7 


21.2 


3.9 



Initial ethanol concentrations at the time of inoculation were approximately 1 .5 g L" 1 for all cultures. With 
acid swollen cellulose as a substrate, these levels declined to 0 after 72 h of incubation for all strains 
25 except SZ6 (pCP206). 

EXAMPLE 3 

Synergistic Hydrolysis of Carboxymethyl Cellulose and Acid-swollen Cellulose by 
Two Endoglucanases (EGZ and EGY) from Erwinia chrysanthemi 

30 This example describes production of the endoglucanases EGY and EGZ by 

recombinant E. coli and the synergistic hydrolysis of carboxymethyl cellulose (CMC) 
and acid-swollen cellulose by these endoglucanases. 

Throughout this example, the following materials and methods are used unless 
otherwise stated. 



35 
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Materials and Methods 

Bacteria, plasmids and culture conditions 

Bacterial strains and plasmids used in this study are listed in Table 9. 



5 TABLE 9. Strains and plasmids used 





Strains / Plasmids 


Descriptions 


References / Sources 




Strains 








Escherichia coli 








DH5a 


lacZ Ml 5 recA 


Bethesda Research 
Laboratory 




B 


Prototrophic 


ATCC11303 




HB101 


recA lacY recA 


ATCC37159 




TOPI OF' 


This strain expresses the lac repressor 
(lacl q gene) from an F episome 


Invitrogen 




Plasmids 






III 


pCR2.1-TOPO 


TOPO IIVl TA Cloning vector, Ap r , Km 1 


Invitrogen 




pRK2013 


Km r mobilizing helper plasmid (mob^) 


ATCC 


a.; " 


pCPP2006 


Sp r , ca. 40 kbp plasmid carrying the 
complete out genes from E. chrysanthemi 
EC16 


He, etal (1991) Proc. 
Natl Acad Sci. USA 
88:1079-1083. 


™"c 


pLOI1620 


Ap r , celZ gene and its native promoter 
from E. chrysanthemi P8602 1 


Beall, etal (1993) J. 
Indus t. Microbiol 
11:151-155. 


= 


pMH18 


Ap r , celY gene and its native promoter 
from E. chrysanthemi 3937 


Guiseppi, et al (1991) 
Gene 106:109-114. 


jf* 


pLOI23 1 1 


celY gene (without native promoter), 
cloned into pCR2.1-TOPO vector and 
oriented for expression from the lac 
promoter 


See text 



Escherichia coli DH5a and TOPO10F' were used as hosts for plasmid constructions. 

The celZ gene was cloned from E. chrysanthemi P86021 (Beall, et al (1993) J. Indust. 
10 Microbiol 1 1:151-155). The ce/F gene was cloned by Guiseppi et al ((1991) Gene 

106: 109-1 14) from E. chrysanthemi 3937. The out genes were cloned by He et al 

((1991) Proc. Acad. Set USA 88:1079-1083) from E. chrysanthemi EC16. 

E. coli cultures were grown at 37°C in Luria-Bertani broth (LB) containing per 

liter: 10 g Difco tryptone, 5 g Difco® yeast extract, and 5 g sodium chloride or on solid 
15 LB medium containing agar (1.5%). Clones were screened for endoglucanase 

production using the Congo Red method (Wood, et al (1989) Biochem. J. 260:37-43). 

Indicator plates were prepared by supplementing LB agar with low viscosity CMC 

(0.3%). Ampicillin (50 |ag/ml), kanamycin (50 ng/ml) and spectinomycin (100 |ig/ml) 

were added as appropriate for selection. 

20 
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Genetic methods 

Standard methods were used for plasmid construction and analyses (Ausubel, et 
al (1987) Current Protocols in Molecular Biology. New York: John Wiley and Sons, 
Inc). The coding region for celY was amplified by the polymerase chain reaction using 
pMHl 8 as the template with the following primer pairs: N-terminus 
5'CTGTTCCGTTACCAACAC3 (SEQ ID NO: 13)', C-terminus 
5 ' GTGA ATGGGATC AC GAGT3 1 (SEQ ID NO: 14). The E. chrysanthemi out genes 
(pCPP2006) were transferred by conjugation using pRK2013 for mobilization (Zhou, et 
al (1999) B. Appl Environ. Microbiol 65:2439-2445). DNA was sequenced by the 
dideoxy method using a LI-COR Model 4000-L DNA sequencer and fluorescent 
primers. 

Enzyme assay 

Endoglucanase activity was determined in vitro using CMC as a substrate. 
Appropriate dilutions of cell-free culture broth (extracellular activity) or broth 
containing cells that had been disrupted by ultrasound (total activity) were assayed at 
35°C in 50 mM citrate buffer (pH 5.2) containing low viscosity CMC (20 g per liter). 
Reactions were terminated by heating in a boiling water bath for 1 0 min. Reducing 
sugars were measured using 3,5-dinitrosalicylic acid reagent with glucose as a standard 
(Wood, et al (1988) Methods Enzymology 160:87-1 12). Enzyme activity (CMCase) is 
expressed as pxnol reducing sugar released per min (IU). Results are an average of two 
or more determinations. 

Synergism 

Stationary phase cultures of DH5a(pLOI1620 + pCPP2006) and 
DH5a(pLOI23 11) were sonicated and centrifuged as described in Zhou, et al (1999), 
supra, as a source of EGZ and EGY, respectively. These were diluted as necessary to 
provide equal CMCase activities. Mixtures of EGZ and EGY were tested for synergy at 
35°C in 50 mM citrate buffer (pH5.2) containing CMC (20 g/L) or acid swollen 
cellulose (20 g per liter). For tests with Avicel® (20 g per liter), enzyme preparations 
were mixed without prior dilution. Hydrolyzed samples of acid-swollen cellulose and 
Avicel® were centrifuged (10,000 x g, 5 min) to remove insoluble material prior to the 
determination of reducing sugars. 

The effect of sequential additions of EGZ and EGY was also investigated. 
Substrates were hydrolyzed with a single enzyme for 4 hours and then inactivated by 
boiling for 20 minutes. After cooling, the second enzyme was added and incubated for 
an additional 4 hours. Control experiments were conducted with both enzymes together 
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(4 hours) and with each enzyme alone (4 hours). Samples were analyzed for reducing 
sugar. In some cases, products were also analyzed by thin layer chromatography. 

The degree of synergism for enzyme mixtures was calculated as the observed 
activity divided by the sum of predicted contributions from EGY alone and EGZ alone 
5 (Riedel, et al (1 997) FEMS Microbiol Lett. 147:239-243). 

Hydrolysis of cellooligosaccharides 

Hydrolysis products from cellobiose, cellotriose, cellotetraose, cellopentaose, 
acid-swollen cellulose (Wood, et al. (1988), supra) and Avicel® were analyzed by thin 

10 layer chromatography. For these analyses, 15 jxl of 1% substrate was mixed with 45 \i\ 
of crude enzyme (0.07 IU), incubated at 35°C for 2 hours and terminated by heating in a 
boiling water bath. Hydrolysates were spotted on Whatman 250 \im Silica-gel 150A 
plates and developed for approximately 4 hours using the solvent system described by 
Kim, (1995) Appl. Environ. Microbiol 61:959-965). By volume, this solvent contained 

15 6 parts chloroform, 7 parts acetic acid, and 1 part water. Sugars were visualized by 
spraying of 6.5 mM N-(l-naphthyl) ethylenediamine dihydrochloride and heating at 
100 C for approximately 10 min (Bounias, (1980) Anal Biochem. 106:291-295). 

Materials and chemicals 
20 Tryptone and yeast extract were products of Difco (Detroit, Michigan). 

Antibiotics, low viscosity CMC, cellobiose, cellotriose, and cellotetraose were obtained 
from the Sigma Chemical Company (St. Louis, Missouri). Cellopentaose was obtained 
from V-Lab (Covington, Louisiana). Avicel® was purchased from Fluka Chemika 
(Buchs, Switzerland). 

25 

Production of EGY and EGZ by recombinant E. coli 

Low levels of EGY activity were produced by native E. chrysanthemi 3937 and 
by recombinant E. coli harboring plasmid pMH18 (Boyer, et al ((1987) Eur. J. 
Biochem. 1 62:3 1 1 -3 1 6; Guiseppi, et al , supra). Poor expression from the high copy 

30 plasmid in E. coli was attributed to promoter function and a putative requirement for a 
celY activator protein (Guiseppi, et al, supra). A new clone was constructed to produce 
higher levels of EGY for our investigations of synergy. The EGY coding region 
(without promoter) was amplified using the polymerase chain reaction and cloned 
behind the lac promoter in pCR2. 1-TOPO. The resulting plasmid, pLOI23 1 1 , was 

35 strongly positive on CMCase indicator plates. Replacement of the native promoter with 
the lac promoter increased celY expression by approximately 10-fold, from 165 IU/L to 
1800 IU/L (See Table 10, below). 
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TABLE 10. Effect of E. chrysanthemi out genes on the expression and secretion of ceiY 



and ce/Z in £. coli DH5a 

5 





No out genes 


out genes present ( pCCF 


'2006) 


Enzyme 
expressed 


Promoter 


Growth 
(hr) 


Extracellular 

CMCase a 
(IU per liter) 


Total 
CMCase 
(IU per liter) 


Apparent 
secretion 

(%) 


Extracellular 

CMCase 3 
(IU per liter) 


Total 
CMCase 
(IU per 
liter) 


Apparent 
secretion 

(%) 


EGY 


Native promoter 
(pMH18) 


24 


136 


165 


82 


136 


180 


76 


lac promoter 
(pLOI2311) 


8 


208 


266 


78 


nd b 


Nd 


nd 


16 


1,420 


1,590 


90 


nd 


Nd 


nd 


24 


1,650 


1,800 


90 


1,360 


1,510 


90 


EGZ 


Native plus 
lac promoter 
(pLOI1620) 


8 


130 


1,320 


10 


6,710 


7,4600 


90 


16 


1,200 


9,030 


13 


13,400 


19,700 


68 


24 


1,800 


12,500 


14 


23,600 


36,800 


64 



a Secreted or released CMCase activity in the culture supernatant. 
b Abbreviation: nd, not determined. 

10 

Approximately 90% of EGY activity was found in the extracellular milieu. 
Expression of celZ was included for comparison (See Table 10). High levels of EGZ 
were produced by E. coli harboring plasmid pLOI1620. Extracellular EGZ and total 
EGZ activity were further increased by addition of the E. chrysanthemi out genes 

1 5 (pCPP2006) as reported previously (Zhou, et al (1 999) Appi Environ. Microbiol 

65:2439-2445). Unlike EGZ, however, EGY activity was not affected by the presence 
of out genes. Maximal EGY and EGZ activities were obtained from 24-hour cultures. 
The supernatants from disrupted cultures of DH5a containing pLOI231 1 or pLOI1620 
and pCPP2006 {out genes) were used as a source of EGY and EGZ, respectively, for 

20 further investigations. 

Synergistic action of EGY and EGZ with CMC as a substrate 

Initial studies examining the combined actions of EGY and EGZ were conducted 
with CMC (20 g per liter) for a single incubation time (Figure 9). Disrupted cell 

25 preparations containing EGY and EGZ were each diluted to equal activities (CMCase) 
and combined in different proportions to maintain a constant sum of individual 
activities. EGY and EGZ were tested individually as controls. All mixtures of EGY and 
EGZ were significantly higher than either enzyme assayed alone indicating a synergistic 
interaction. The synergistic effect increased with the proportion of EGZ. Maximal 

30 synergy (1.42) was observed with a ratio of EGZ to EGY activities of 9 to 1 and 19 tol . 
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Further experiments examined the effect of incubation time using CMC as the 
substrate and an activity ratio of 9 to 1 for EGZ and EGY, respectively (Figure 9). EGZ 
and EGY alone were included as controls. The synergistic effect of combining EGZ and 
EGY was clearly evident as an increase in the rate and extent of hydrolysis. Calculated 
5 synergy increased with incubation time. At the end of the incubation (4 hours), the 
concentration of reducing sugars was 1 .8-fold higher in the mixed enzyme preparation 
than predicted by the arithmetic sum of individual EGZ and EGY activities, i.e., 
synergistic activity. 

10 Effect of substrate (CMC) concentration on synergy 

The effect of substrate concentration on the synergy between EGZ and EGY was 
also studied (See Table 11, below). Increasing the CMC concentration from 2.5 g per 
liter to 20 g per liter increased the observed synergy from 1.12 to 1.89. Based on the 
specific activities of EGZ and EGY and a maximal synergism of 1.89, the enzyme 

1 5 turnover rate for the combination was 8-fold that of purified EGY alone, and 1 .5-fold 
that of purified EGZ alone. 

TABLE 11. Effect of substrate concentration on synergy 



CMC Substrate 
(g/L) 


Reducing sugar released (umole/ml) a 


Synergy 3 ' 0 


EGZ (10) D 


EGY(10)° 


EGZ(9) + EGY(1)° 


20 


3.98 ±0.04 
4.53 ±0.01 


3.83 ±0.04 
2.91 ±0.07 


7.51 ± 0.07 
5.38 ±0.04 


1.89 ±0.02 


10 


1.25 ±0.01 


5.0 


2.87 ±0.01 


1.18 ±0.04 


2.92 ± 0.04 


1.08 ±0.02 


2.5 


1.42 ±0.01 


0.50 ± 0.04 


1.49 ±0.01 


1.12±0.01 



20 

a Average ± standard deviation. 

b EGZ and EGY were diluted to equal CMCase activities. Reactions (0.15 IU/ml) contained 9 
parts of EGZ and 1 part of EGY. As controls, EGZ (0.15 IU/ml) and EGY (0.15 IU/ml) were 
each tested individually. 
25 c Synergy was calculated as the observed activity divided by the sum of predicted 
contributions from EGY alone (10%) plus EGZ alone (90%). 

EGY was more sensitive to substrate concentration than EGZ. Increasing the 
CMC concentration resulted in an 8 -fold increase in reducing sugar products with EGY 
30 but only a 3-fold increase with EGZ. Based on a double reciprocal plot of the data in 
Table 11, apparent Km values of 104, 12, and 38 g per liter were estimated for EGY, 
EGZ and the combination of both enzymes (9 parts EGZ + 1 part EGY), respectively. 
The higher apparent Km for EGY is consistent with a requirement for longer substrate 
molecules. 
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The extent of CMC hydrolysis was also examined by determining the 
approximate size of hydrolysis products. CMC (1.25 g per liter) was incubated (4 hours, 
0.75 IU CMCase/ml) with EGY 5 EGZ, and a combination of both enzymes (9 parts EGZ 
plus 1 part EGY). Chain length was estimated based on the reducing sugar assay before 
5 (250 glucosyl units) and after incubation. The average chain length was substantially 
reduced by all three enzyme preparations. EGZ was more effective in reducing chain 
length than EGY, 3.6 versus 10.7 glucosyl residues, respectively. The combination of 
both enzymes resulted in a synergistic action. Simultaneous hydrolysis with both 
enzymes reduced the average size of the hydrolysis products to 2.3 glucosyl residues, 
10 36% lower than EGZ alone and 79% lower than EGY alone. These results confirm that 
EGZ readily hydrolyzes both large CMC polymers and smaller saccharides. The action 
of EGY appears more limited, primarily hydrolyzing large polymers with greater than 
1 0 glucosyl units. 



1 5 Sequential and simultaneous hydrolysis of CMC with EGZ and EGY 



The mechanism of synergistic action between EGZ and EGY was further 
investigated by comparing the effects of sequential hydrolysis with individual 
endoglucanases to that of simultaneous hydrolysis by a mixture of both enzymes (See 
Table 12, below). Again synergy was observed for the simultaneous actions of both 

20 enzymes. No synergy was observed for the sequential hydrolysis of CMC when EGZ 
was used as the first enzyme, followed by digestion with EGY (after heat-inactivation of 
EGZ). In contrast, full synergy was retained when CMC was first treated with EGY, 
followed by EGZ (after heat-inactivation of EGY). These results indicate that synergy 
can be achieved by the independent activities of EGY and EGZ. Enzymatic 

25 modification of the substrate by EGY increased the rate and extent of subsequent 

hydrolysis by EGZ. These results provide further evidence that EGY and EGZ function 
quite differently. EGY appears to primarily reduce the chain length of large polymers 
while EGZ appears to act more randomly, hydrolyzing both large and small substrate 
molecules. 
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TABLE 12. Sequential and simultaneous hydrolysis of CMC by EGZ and EGY 



il( ilij y III V VlallTV LI 1 l_l | J V I 1 


lVff*$)<£iif*f*f1 rpHiirincr ^iioar 

ITlVuSUl I (/U UVlllg UUgAl 

released (^imole/ml) b 


PrerHotpfl artivitv 

from the arithmetic 
sum of EGY and 
EGZ 
(fimoIe/mI) c 




EGZ (10) + EGY (0) 


4.65 ± 0.08 


4.65 


1.00 ±0.02 


EGZ (0) + EGY (10) 


4.14 ±0.04 


4.14 


1.00 ±0.01 


EGZ (9) + EGY (1). 
(simultaneously) 


8.28 ±0.08 


4.60 


1.80 ±0.02 


EGZ (9), then EGY (1). (sequential) 


4.86 ±0.23 


4.60 


1.06 ±0.05 


EGY (1), then EGZ (9). (sequential) 


8.75 ±0.14 


4.60 


1.90 ±0.03 



a EGZ and EGY were diluted to equal CMCase activities. Both simultaneous and sequential 
5 hydrolysis reactions (0.15 IU/ml) were investigated using 9 parts of EGZ and 1 part of EGY. 
In the sequential hydrolysis experiments, the first enzyme was incubated with substrate for 
4 hours and inactivated by boiling for 20 min. After cooling, the second enzyme was added and 
incubated for an additional 4 hours. All reactions were terminated by boiling. 
b Average ± standard deviation (3 experiments). 
10 c Calculated sum of individual EGY and EGZ activities. 

d Synergy was calculated as the observed activity divided by the sum of predicted 
contributions from EGY alone (10%) plus EGZ alone (90%). 

Synergistic action on acid-swollen and crystalline cellulose 

1 5 Potential synergy was investigated using acid-swollen cellulose as the substrate 

and a 9 tol ratio of EGZ:EGY based on CMCase activities (Figure 9). Since the 
activities of EGZ and EGY with acid- swollen cellulose are lower than those with CMC 
(Boyer, et ah (1987) Eur. J. Biochem. 162:31 1-316), enzyme loading (1.5 IU) and 
incubation times were increased. When assayed individually with acid-swollen 

20 cellulose, EGY was approximately 1/3 as active as EGZ. However, the combination of 
these two enzymes was significantly more active than the predicted arithmetic sum of 
individual activities at all time points. The degree of synergy was essentially constant 
(1 .36 ± 0. 1 7) during the 36 hour period -of incubation. 

The hydrolysis products from acid-swollen cellulose (6 hours) were analyzed by 

25 thin layer chromatography (Figure 1 0). No soluble saccharides were observed after 

incubation with EGY alone. Cellobiose and cellotriose were the primary products from 
hydrolysis with EGZ alone and a combination of EGY and EGZ. With the combination 
of both enzymes, higher product levels were evident as darker and larger spots 
confirming a synergistic action. 

30 The synergistic action of EGZ and EGY was also investigated with Avicel® 

(Figure 10), a highly crystalline cellulose. Small amounts of cellobiose and cellotriose 
were observed as hydrolysis products with EGZ alone and with the mixture of EGY and 
EGZ. Due to low activity with Avicel®, high loadings (10 |al) were required on thin 
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layer plates to visualize products. Note that this additional salt increased the relative 
migration of oligosaccharide products in comparison to the standards. No 
cellooligosaccharide spots were observed with EGY alone. Again synergism was 
evident with the combination of EGY and EGZ. Larger and more intense spots were 
observed corresponding to cellobiose, cellotriose, and cellotetraose with the combined 
activities than with EGZ alone. The low activity with Avicel® as a substrate and the 
relatively low levels of products are consistent with the hydrolysis of the amorphous 
rather than the crystalline regions of Avicel®. These results indicate that the synergistic 
action of EGZ and EGY is not limited to a model substrate such as CMC. Synergistic 
hydrolysis was also observed for acid-swollen cellulose and the amorphous regions of 



Hydrolysis of cellooligosaccharides 

The substrate specificities of EGZ and EGY were further investigated using 
soluble cellooligosaccharides (cellobiose, cellotriose, cellotetraose, and cellopentaose). 
Hydrolysis products were analyzed by thin layer chromatography (Figure 1 1). 
Cellobiose was not hydrolyzed by EGY, EGZ or a combination of both enzymes. None 
of the cellooligosaccharides was hydrolyzed by EGY alone (Figure 1 1 , Panel B). In 
contrast, EGZ hydrolyzed cellotetraose and cellopentaose but not cellotriose (Figure 1 1 , 
Panel C). EGZ hydrolysis products from cellotetraose were primarily cellobiose with 
lesser amounts of cellotriose and glucose. With cellopentaose as the substrate, EGZ 
produced approximately equal amounts of cellobiose and cellotriose indicating a 
preferential attack on the second or third glycosidic bond. This was further confirmed 
by examining samples at various times during the incubation of cellopentaose with EGZ 
(Figure 1 1, Panel D). Cellobiose and cellotriose progressively accumulated during 
incubation with a corresponding reduction in cellopentaose. Thus in contrast to the 
requirement for large substrates by EGY, EGZ hydrolyzes soluble cellooligosaccharides 
containing four or more glucosyl units. 



EXAMPLE 4 

Integration, Expression, and Extracellular Secretion of Erwinia chrysanthemi 
Endoglucanase EGY (celY) and EGZ (celY) in Ethanologenic Klebsiella oxytoca P2 

In this example, the functional integration of both celYand celZ from E. 
chrysanthemi into the chromosome of K. oxytoca P2, is described. Also described is the 
synergism between recombinant EGY and EGZ and fungal cellulase (Spezyme CE®) 
during the fermentation of cellulose to ethanol using simultaneous saccharification and 
fermentation. 
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Throughout this example, the following materials and methods are used unless 
otherwise stated. 

Materials and Methods 

Bacteria, plasmids and culture conditions 

Strains and plasmids used in this example are listed in Table 13 below. 

TABLE 13. Strains and Plasmids 



Strains / plasmids 


Description 


Sources / references 


E. coli strains 






DH5a 


lacZ Ml 5 recA 


Bethesda Research 
Laboratory 


TOPI OF' 


hsdR mcrA /acZAMIS endA recA; F' tet lad 


lnvitrogen 


HB101 


recA lacY 


ATCC37159 


S17-1 


thipro recA hsdR RP4-2-tet::Mu aphA.Tnl Xpir 


De Lorenzo, et al. 
172:6568-6572. 


Z. mobilis strain 






CP4 


Prototrophic 


Ingram, et al (1999) 
Biotechnol Prog, 

l3.0_>J-000. 


K. oxytoca strains 






M5A1 


Prototrophic 


Wood, etal (1992) 
Appl. Environ. 
Microbiol. 58:2103- 
2110. 


rz 


pflr.pdc adhB cat 


WOOO, etal. yiyyZ) 
Appl Environ. 
Microbiol. 58:2103- 
2110. 


SZ6 


pfl- 


:pdc adhB cat; integrated celZ tet 




SZ12 


pfl- 


:pdc adhB cat; integrated celZ celY kan 


See text 


SZ21 


pfi. 


:pdc adhB cat; integrated celZcelY 


See text 


SZ22 


pfl 


:pdc adhB cat; integrated celY celZr.aac 


See text 


Plasmids 






pUC18 


bla cloning vector 


New England Biolabs 


pUC19 


bla cloning vector 


New England Biolabs 


pCR2J-TOPO 


TA cloning vector, bla kan 


lnvitrogen 


pMH18 


bla celY from E. chrysanthemi 3937 


Guiseppi, etal (1991) 
Gene 106:109-114. 


pHPQ45aaac 


bla aac source of apramycin gene 


Blondelet-Rouault, et 
al (1997) Gene 
190:315-317. 


pBR322 


bla tet cloning vector 


New England Biolabs 


pRK2013 


kan, mobilizing plasm id 


ATCC 
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TABLE 13. Strains and Plasmids (Continued) 



pCPP2006 


spm, ca. 40 kbp fragment containing out genes from E. 
chrysanthemi EC 1 6 


He, etal (1991) Proc. 
Natl. Acad. Sci. USA 
88:1079-1083. 


nFT-A 


and temperature conditional pSClOl replicon 


IVldl HllCZi _ lVlUl alCo, et 

al (1999)./ BacterioL 
181:7143-7148. 


DLOI2224 


hriYi intftrratinn vprtnr prints in in q prvnHitirvnal R^T£ 

rVL4f l IIlLV^gl ClllUll V ^ W IU1 V.J1 1 Id 1 1 1 1 1 1^, 11U11CU I\UJ\ 

replicon and two FRT sites 


A/fnT+irif»7-A/Tor5ilpQ 

a/. (1999) J. BacterioL 
181:7143-7148. 


pLOI2307 


bla containing celZ gene and a surrogate promoter 
from Z mob i lis DNA 


Zhou, etf a/. (1999) 

A 11"" 

Appl. Environ. 
Microbiol 65:2439- 
2445. 


nLOI23 1 1 


T*(^R fraprnftnt cftntairnnp" /'v/V' ctptip rlntif^H ititn 

A \ ' l V 11 ucllllvllL wUlllCllllllltl t-- L J 2^ W 1 L v vlv/llvU 111 LV_/ 

pCR2.1-TOPO, expressed from /ac promoter 


7hou et al f2000^ / 
BacterioL 182:5676- 
5682. 


pLOI2302 


pUC19 containing Ascl linkers inserted into blunt Ndel 
and Sapl sites 


Zhou, (1999; J. 
Indust. Microbiol. 
Biotechnol. 22:600- 
607. 


pLOI2316 


pUC 18 containing the ce/Fgene on a Klenow-treated 
EcoRl fragment from pLOI23 1 1 inserted into a blunt 
Hindi site, expressed from the lac promoter 


See text 


pLOI2317 


EcoRl-Hindlll fragment from pLOI2316 inserted into 
the corresponding sites of pLOI2302 


See text 


pLOI2318 


Sau3Al fragment of Z mobilis DNA fragment 
exhibiting promoter activity inserted into the BamHl 
site ofpLOI2317 


See text 


pLOI2319 


Saw3Al fragment of Z mobilis DNA exhibiting 
promoter activity inserted into the BamHl site of 
pLOI2317 


See text 


pLOI2320 


iSaw3Al fragment of Z mobilis DNA exhibiting 
promoter activity inserted into the BamHl site of 
pLOI2317 


See text 


pLOI2323 


Sau3Al fragment of Z mobilis DNA exhibiting 
promoter activity inserted into the BamHl site of 
pLOI2317 


See text 


pLOI2342 


Saw3Al fragment of Z mobilis DNA exhibiting 
promoter activity inserted into the BamHl site of 
pLOI2317 


See text 


pLOI2348 


Random EcoRl fragment of K. oxytoca MSA 1 DNA 
cloned into EcoRl site of pLOI2323 


See text 


pLOI2349 


EcoRl linker inserted into the Klenow-treated Sphl 
site ofpLOI2307 


See text 


pLOI2350 


EcoRl fragment (celZ and surrogate promoter) from 
pLOI2349 inserted into the EcoRl site of pLOI2224 


See text 


pLOI2352 


Ascl fragment (K. oxytoca fragment, Z mobilis 
promoter fragment and celY) from pLOI2348 inserted 
into the Ascl site of pLOI2350 


See text 
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TABLE 13. Strains and Plasmids (Continued) 



pLOI2353 


EcoRl-Aval fragment (Jet gene) from pBR322 inserted 
into the Cla\ site of pFT-A. 


See text 


pLOI2354 


pUC19 derivative in which the multiple cloning sites 
from Hindlll to Smal were deleted by digestion, 
Klenow-treatment, and self-ligation 


See text 


pLOI2355 


EcoRl fragment (ce/Z gene) from pLOI2349 inserted 
into the EcoRl site of pLOI2354. 


See text 


pLOI2356 


S/wal fragment containing the apramycin resistance 
gene (aac gene) from pHPf245aac inserted into the T4 
polymerase-treated Pstl site of pLOI2355, disrupting 
the celZ gene 


See text 


pLOI2357 


EcoRl fragment {aac and disrupted celZ ) inserted into 
the EcoRl site of pLOI2224 


See text 


pLOI2358 


Subclone of pLOI2323 in which the internal Pstl 
fragment was deleted, used for sequencing 


See text 


pLOI2359 


Subclone of pLOI2323 in which the Clal-Hindlll 
fragment was deleted, used for sequencing 


See text 



Escherichia coli DH5a and TOPO10F' were used as hosts during plasmid 
constructions. The celZ gene, ce/Fgene, and out genes were cloned as described in 
Example 3. 

E. coli cultures were grown at 37°C in Luria-Bertani broth (LB) containing per 
liter: 10 g Difco® tryptone, 5 g Difco® yeast extract, and 5 g sodium chloride or on solid 
LB medium containing agar (1 .5 %). Sugar was always included in broth (5% glucose 
or sorbitol) and solid media (2% glucose) used for the growth of ethanologenic strains. 
Clones were screened for endoglucanase production using the Congo Red method 
(Wood et at (1988) Methods Enzymology 160:87-1 12). Endoglucanse indicator plates 
were prepared by supplementing LB agar with 0.3% low viscosity carboxy methyl 
cellulose (CMC). Ampicillin (50 ^tg/ml), apramycin (100 jig/ml), kanamycin (50 
(ig/ml), chloramphenicol (40 (ag/ml) and spectinomycin (100 |ug/ml) were used for 
selection. Ethanologenic strains of K. oxytoca were maintained at 30°C on solid LB 
medium containing glucose (2%) and chloramphenicol (600 jag/ml). 

Genetic methods 

Standard methods were used for plasmid construction, analyses, and sequencing. 
The ribosome-binding site and promoterless coding region of celY were amplified by the 
polymerase chain reaction using pMH18 as the template with the following primer pairs: 
N-terminus 5'CTGTTCCGTTACCAACAC3' (SEQ ID NO: 13), C-terminus 
5'GTGAATGGGATCACGAGT3' (SEQ ID NO: 14). The E. chrysanthemi out genes 
(pCPP2006) were transferred by conjugation using pRK2013 for mobilization. 
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Constructions were confirmed by sequencing using the dideoxy method and a LI-COR 
Model 4000-L DNA sequencer with fluorescent primers. The E. chrysanthemi celY and 
celZ genes were introduced into K. oxytoca P2 by electroporation using a Bio-Rad Gene 
Pulser®. Recombinants were selected on solid medium containing kanamycin (50 
5 mg/liter) as described in Martinez-Morales, et al (1999) J. Bacteriology 181 :7143-7148, 
and Zhou, et al (1999) Indust Microbiol Biotechnol 22:600-607. 

Primer extension analysis 

Promoter regions were identified by mapping the transcriptional start sites using 
10 a IRD41 -labeled primer fluorescent primers within the coding regions: 5 f - 

ACC ATC AGC ATC AACGCCC AAC AACG-3 ' (SEQ ID NO: 15) for celY and 5'- 
GACTGGATGGTTATCCGAATAAGAGAGAGG-3' (SEQ ID NO: 16) for celZ. 
Extension products were dissolved in loading buffer and compared to parallel dideoxy 
sequences using the LI-COR Model 4000-L DNA sequencer (Lincoln, Nebraska). 

15 

Enzyme assay 

Endoglucanase activity was determined as described in Example 3. 
Fermentation 

20 Simultaneous saccharification and fermentation (SSF) tests were conducted in 

unbaffled, 500-ml flasks containing a 200 ml of broth. Flasks were fitted with a rubber 
stopper and vented with an 1 8 gauge needle. Fermentations were conducted at 35°C 
(120 rpm) in LB medium containing 10% Sigmacell 50 (crystalline cellulose). Inocula 
were grown for 12 hours in LB containing 5% glucose. Cells were harvested by 

25 centrifiigation and resuspended in LB. Each flask was inoculated to provide an initial 
density of 16 mg of cells (dry weight). 

Materials and chemicals 

Tryptone and yeast extract were products of Difco (Detroit, Michigan). 
30 Antibiotics, low viscosity CMC, and Sigmacell 50® were obtained from the Sigma 

Chemical Company (St. Louis, Missouri). The IRD41 -labeled fluorescent primers were 
purchased from LI-COR, Inc. (Lincoln, Nebraska). 

Construction of a promoter-probe vector for celY 

35 The gene from E. chrysanthemi is poorly expressed from its native 

promoter in E. coli (See Guiseppi, et al (1991) Gene 106:109-1 14). Accordingly, to 
increase expression, a promoter-probe vector was constructed as follows using celY as 
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the reporter (See Figure 13). A promoterless celY coding region with ribosomal-binding 
site (1 .2 kbp) was amplified by PCR using pMHl 8 as the template and randomly 
inserted into the topoisomerase vector, PCR2.1-TOPO. Functional expression of celY 
was confirmed using endoglucanase indicator plates. A clone oriented to express celY 
5 from the lac promoter was selected and designated pLOI23 11 (5.2 kbp). An EcoRl 

fragment containing the promoterless celY gene was isolated from pLOI23 1 1 . The ends 
of this fragment were blunted using Klenow polymerase prior to ligation into the Hindi 
site of pUC18. A clone oriented to express celY from the lac promoter was selected (3.9 
kbp) and expression confirmed using endoglucanase indicator plates (pLOI2316). The 

1 0 promoterless celY gene was isolated from pLOI23 1 6 as a 1 .2 kbp fragment using EcoRI 
and HindUl and inserted into the corresponding sites of pLOI2302 (pUC19 derivative) 
to reverse the direction of the gene. As expected, the resulting construct 
DH5a(pLOI2317) was inactive on endoglucanase indicator plates due to the lack of a 
promoter. To facilitate the insertion of DNA fragments containing promoter regions, 

1 5 plasmid pLOI23 1 7 (3 .9 kbp) contains a BamHl site in the polylinker region, 
immediately upstream from the celY gene (See Figure 13). 

Construction of plasmids with increased expression ofcelYin E. coliDHSa 

Sau3A\ fragments of Z mobilis chromosomal DNA were used to provide a 
20 heterologous promoter that would not be subject to native regulatory mechanisms in K. 
oxytoca or interfere with subsequent integration into the K. oxytoca chromosome (Zhou, 
etal (1999) J. IndusL Microbiol Biotechnol 22:600-607).Fragments of 0.5-1. 5 kbp 
were isolated and randomly ligated into the BamHl site of pLOI2317 to generate a 
library of surrogate promoters (See Figure 13). Approximately 75,000 colonies were 
25 screened on endoglucanase indicator plates. One-third of the clones actively produced 
celY. The most active 100 colonies were identified by zone size, purified, and re-tested. 
The 30 clones with the largest zones of activity were grown overnight in LB and assayed 
for CMCase activity. The five most active are listed in Table 14 below, and exhibited 
approximately 7-fold higher activity than the original clone, pMH18. Plasmid 
30 pLOI2323 was selected for further investigation. 
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TABLE 14. Expression of ce/Yin DH5a using fragments using Sau3A1 digestion 
products of Z. mobilis chromosomal DNA as surrogate promoters. 



Plasmids expressing 
celY or celZ 


Endoglucanase activity (IU/L) 


Extracellular 


Total 


% Extracellular 


pMH18 

(native celY promoter) 


151 


184 


82 


pLOI2317 

(promoterless celY vector) 


0 


0 


0 


celY expressed from surrogate promoters 


pLOI2318 


1,123 


1,257 


89 


pLOI2319 


888 


1,023 


87 


pLOI2320 


1,023 


1,056 


97 


pLOI2323 


1,257 


1,291 


97 


pLOI2342 


1,224 


1,257 


97 


pLOI2349 (celZ) 


3,414 


16,234 


21 



5 All plasmids are pUC derivatives. Endoglucanase activity was measured using cultures grown at 
37°C for approximately 16 hours. 



The Z mobilis Sau3Al fragment (937 bp) in pLOI2323 was sequenced in both 
directions (GenBank Accession No. AF305919). Based on a database comparison, this 

10 fragment appears to be derived from two pieces, a 882 bp fragment form Z. mobilis 

chromosome which corresponds to a previously sequenced region and a 55 bp fragment 
from the vector. A BLAST search (National Center for Biotechnology Information; 
http://www.ncbi.nlm.nih.gov/BLAST/ ) of the translated sequence did not reveal identity 
to known genes. Four sites of transcriptional initiation were identified in DH5a 

15 (pLOI2323) by primer extension analysis involving three different sigma factors, 8 , 
5 38 , and 8 70 (See Figure 14). Although the differences in intensity were less than 2-fold, 

32 

the sequence upstream from the most intense start site resembled the consensus for 8 
(rpoH), the heat shock promoter (Wang, et al. (1998) J. Bacteriology 180:5626-5631; 
Wise, etal (1996) J Bacteriology 178:2785-2793). 

20 

Construction of a vector for the integration of celY and celZ into the chromosome of 
K. oxytoca P2 

The plasmid pLOI2307 (7.2 kbp) was constructed and used to express celZ from 
a surrogate Z mobilis promoter at high levels in recombinant E. coli DH5a (See Zhou, 
25 et al (1999) B.Appl Environ. Microbiol 65:2439-2445) and K oxytoca M5A1 (See 

Zhou, et al (1999) Indust. Microbiol Biotechnol 22:600-607). To facilitate subcloning 
of this hybrid celZ gene and promoter (4.5 kbp), an EcoKl linker was inserted into the 
T4 polymerase-treated Sphl site of pLOI2307 to provide flanking EcoKL sites for 
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convenient excision (pLOI2349). Prior to constructing a plasmid containing celY and 
celZ, a random 3 kbp fragment of £coRI-digested K. oxytoca M5A1 chromosomal DNA 
was inserted into pLOI2323 containing celY (and surrogate promoter) to serve as a guide 
for homologous recombination (pLOI2348; 8 kbp). This 3 kbp M5A1 fragment was 
partially sequenced and appears to encode the complete M5A1 glgP gene. In pLOI2348 
(8 kbp), flanking Ascl sites allowed the excision of a single 5.5 kbp fragment containing 
the M5A1 glgP gene, Z mobilis surrogate promoter, and E. chrysanthemi celY. 

Figure 15 summarizes the construction of the celY, celZ integration vector from 
pLOI2349, pLOI2224, and pLOI2348. The recombinant celYand celZ genes containing 
surrogate promoters and the guide fragment were sequentially inserted into the core 
integration vector, pLOI2224 (Martinez-Moralez, et al, supra) using E.coli SI 7-1 as the 
host, to produce pLOI2352 (12 kbp). The 4.5 kbp EcoRI fragment from pLOI2349 
containing celZ was inserted into pLOI2224 using an EcoRI site to make pLOI2350 (6.4 
kbp). The 5.5 kbp Ascl fragment from pLOI2348 containing c^/fwas inserted into the 
Ascl site of pLOI2350 to make pLOI2352 (12 kbp). The fragments containing eel genes 
were oriented such that expression from the surrogate promoters was divergent. The 
resulting vector contained a R6K replicon that does not function in DH5a or M5A1. 
The two FRT sites in pLOI2352 facilitate removal of the kanamycin gene and replicon 
after integration (Martinez-Moralez, et ah, supra). 

Functional integration of celY and celZ into the K. oxytoca P2 chromosome 

Plasmid pLOI2352 was introduced into P2 by electroporation followed by 
selection for kanamycin resistance. Approximately 150 colonies were recovered and all 
were positive on endoglucanase indicator plates. Ten clones with the largest zones of 
activity were purified, grown in broth and assayed for endoglucanase activity. These 
produced 5-6 IU/ml of endoglucanase activity. One clone was selected for further study 
and designated as SZ12. 

Due to the natural resistance of K. oxytoca to ampicillin, an additional antibiotic 
resistance marker (tet) was added to pFT-A plasmid containing the flp recombinase to 
facilitate selection. The tetracycline gene was isolated as a 1 .4 kbp EcoRI to Aval 
fragment from pBR322. After treatment with Klenow polymerase, this fragment was 
ligated into the Klenow-treated Clal site of pFT-A to produce pLOI2353 (7.0 kbp). This 
plasmid encodes resistance to both ampicillin and tetracycline, the FLP recombinase 
(flp) under the control of the tetracycline promoter, and a temperature-conditional 
pSClOl replicon. 

Plasmid pLOI2353 was transformed into SZ12 and plated at 30°C with selection 
for tetracyline resistance. The presence of tetracycline also induced flp expression 
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resulting in a deletion of the kanamycin gene and R6K replicon from chromosomally 
integrated pLOI2353. Of 307 tetracycline-resistant colonies tested, >99% retained 
expression of the endoglucanase genes and were sensitive to kanamycin. Clones were 
purified, grown in broth and assayed for endoglucanase activity. All were similar and 
one was designated SZ21(pLOI2353). The helper plasmid was eliminated from SZ21 
by overnight growth at 37°C. 

Construction of a celZ knockout mutation 

To confirm the presence of functional celYm SZ21, a knockout mutation of the 
chromosomally integrated celZ was constructed by double, homologous recombination 
using plasmid pLOI2357 (Figure 16). Plasmid, pUC19 was digested with Smal and 
Hindlll, treated with Klenow polymerase, and self-ligated to eliminate many of the 
polylinker sites (pLOI2354). The remaining EcoRl site was used to insert a 4.5 kbp 
EcoRl fragment containing the promoter and celZ gene from pLOI2349 to make 
pLOI2355 (7.2 kbp). The 1 .8 kbp Smal fragment from pHPQ45aac containing the 
apramycin resistance gene (aac) was then ligated into the central region of celZ, 
replacing a small internal Pstl fragment (after blunting with T4 polymerase) to produce 
pLOI2356 (9.0 kbp). The 6.3 kbp EcoRI fragment from this plasmid was isolated and 
inserted into the core integration vector, pLOI2224, to produce pLOI2357 (8.2 kbp). 
This plasmid contains a conditional R6K replicon and kanamycin resistance gene in 
addition to a celZ gene that is interrupted by an apramycin resistance gene. 

Plasmid pLOI2357 was electroporated into SZ21 with selection for apramycin. 
Approximately 10% of the recombinants were apramycin resistant and kanamycin 
sensitive indicating a double homologous recombination event. These clones exhibited 
low levels of endoglucanase production on indicator plates (See Table 15, below). One 
was selected and designated SZ22. Loss of EGZ and retention of EGY in SZ22 was 
confirmed by SDS-PAGE using the Pharmacia Phast Gel system. 

It is interesting to note that cell clumping in liquid culture, typical of M5A1 and 
P2, was eliminated by the functional expression of ce/Zfrom integrated genes or from 
plasmids. Clumping was not affected by the functional expression of celY alone. 

Transcriptional initiation in K. oxytoca SZ21 

Primer extension analysis of celYond celZ in SZ21 were similar to those 
observed in DH5a. A single major transcriptional start was identified for ce/Zthat 
corresponded precisely to the most prominent start site in DH5a (pLOI2183) which 
contains the same promoter fragment (Zhou, et al (1999) J. Indus t. Microbiol 
Biotechnol. 22:600-607; Zhou, et al (1999) B. Appl Environ. Microbiol 65:2439- 
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2445). DNA immediately upstream from this site resembles the recognition sequence 
for a a promoter (Wang, et al, supra and Wise, et al, supra). As observed with DH5a 
(pLOI2323) (See Figure 14), primer extension analysis of celY indicated the presence of 
multiple putative transcriptional starts in SZ21 . Although localized in the same regions 
5 as the start sites in DH5a(pLOI2323), all bands were of near equal intensities. 

Effect of the E. chtysanthemi out genes (pCPP2006) on the extracellular secretion of 
EGY and EGZ in derivatives of K. oxytoca P2 

Table 7 summarizes the endoglucanase activities exhibited by cellulolytic 

10 derivatives of ethanologenic K oxytoca P2. Strain SZ6 (Zhou, et al., (1999) J. Indus t. 
Microbiol. Biotechnol 22:600-607) contains a chromosomally integrated hybrid celZ 
gene with same promoter fragment used to construct SZ2 1 . Despite the presence of two 
endoglucanase genes in SZ21, extracellular and total endoglucanase activities were 13% 
lower in this strain than in SZ6. Most of the endoglucanase activity produced by SZ21 

1 5 can attributed to celZ. SZ22, a celZ mutant of SZ2 1 , expressed only 1 1 % of the 

endoglucanase produced by the parent containing functional celY and celZ genes. In 
strains SZ6 and SZ21 containing a functional ce/Z, most of the endoglucanase activity 
(primarily EGZ) was cell associated. In strain SZ22 containing a functional celY alone, 
half of the endoglucanase activity was extracellular. 

20 

TABLE 15. Effect of out genes (pCPP2006) on endoglucanase production by derivatives 
of K. oxytoca P2, 



Strains 


CMC zone 
(mm) 


OD 550 


CMCase Activity (IU/L) a 


Extracellular 


Total 


Secretion (%) 


P2 


0 


10.5 


0 


0 


0 


SZ6 


8.5 


11.0 


1,920 


8,800 


22 


SZ21 


6.7 


11.0 


1,620 


7,800 


21 


SZ22 


2.0 


10.0 


480 


879 


55 


P2 (pCPP2006) 


0 


10.0 


0 


0 


0 


SZ6 (pCPP2006) 


10.8 


9.6 


13,800 


22,300 


62 


SZ21 (pCPP2006) 


11.5 


10.2 


20,100 


26,900 


75 


Spezyme CE^ (10 ml/liter) b 








27,000 




Spezyme CP^ (10 ml/liter) b 








33,400 





25 a Endoglucanase activity was measured using cultures grown at 30°C in LB containing 5% sorbitol for 24 
h. 

Dilution equivalent to the highest Spezyme level used in fermentation experiments (Table 4). 
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The addition of the out genes (pCPP2006) to recombinant E. coli and K. oxytoca 
M5A1 harboring celZ can cause a dramatic increase in the functional expression of celZ 
and in the fraction of EGZ secreted into the extracellular milieu (Zhou, et ah, (1999) J. 
Indust Microbiol Biotechnol 22:600-607; Zhou, et al (1999) B. Appl Environ. 
5 Microbiol 65:2439-2445). These same effects were observed for ethanologenic K. 
oxytoca SZ21 containing celZ and celY (See Table 14). Addition of the out genes to 
SZ22 (inactive celZ) had no effect on the functional expression of celY or the extent of 
EGY secretion. This celZ mutant, SZ22 (pCPP2006) ? produced only 3% of the total 
endoglucanase activity (EGY) produced by SZ21 (pCPP2006) containing functional 

10 celY and celZ genes. The secreted endoglucanase produced by SZ21 with the out genes 
was substantially higher than the sum of the individual activities expressed from the 
same respective promoters in SZ6 (EGZ) and SZ22 (EGY), consistent with synergism 
between these two enzymes. In this assay, synergy is estimated to be 1 .4-fold the 
arithmetic sum of the individual activities (SZ6(pCPP2006) and SZ22(pCPP2006)) for 

15 the combination of extracellular enzymes produced by SZ21(pCPP2006). 

Synergism between recombinant E. chrysanthemi endoglucanase (EGZ and EGY) 
and fungal cellulase (Spezyme CE®) during the fermentation of cellulose to ethanol 

SSF tests in flasks were used without pH control to evaluate the combined 
20 effects of fungal cellulase (Spezyme®) and cellulase enzymes produced by the 

biocatalysts on ethanol production from Sigmacell 50®, a highly crystalline substrate. 
Results are indicated in Table 16, below. 



TABLE 16. Maximum ethanol production 

25 



Strain 


Genencor Spezyme® 


Fermentation 3 - 1 


Type 


Addition 
(ml/liter) 


N 


Ethanol ± SD 
(g/liter) b) 


% Control ± SD 


P2 (pCPP2006) 


none 


0 


3 


0.23 ±0.01 


100 ±2.0 


SZ6(pCPP2006) 


none 


0 


3 


0.28 ± 0.02 * 


124 ±8.5 


SZ21(pCPP2006) 


none 


0 


3 


0.26 ± 0.02 * 


116 ± 8.5 


SZ22(pCPP2006) 


none 


0 


3 


0.24 ±0.01 


107 ± 1.0 


P2(pCPP2006) 


CE 


5.0 


6 


13.7 ±0.3 


100 ±2.0 


SZ6(pCPP2006) 


CE 


5.0 


6 


13.8 ±0.3 


101 ±2.4 


SZ21(pCPP2006) 


CE 


5.0 


6 


16.0 ±0.5 ** 


117 ± 3.5 


SZ22(pCPP2006) 


CE 


5.0 


6 


15.2 ±0.3 ** 


112 ± 1.5 


P2(pCPP2006) 


CE 


10.0 


6 


20.7 ±0.5 


100 ±2.1 


SZ6(pCPP2006) 


CE 


10.0 


6 


21.2±0.1 


103.4 ±0.4 
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TABLE 16. Maximum ethanol production (Continued) 



SZ2I(pCPP2006) 


CE 


10.0 


6 


24.6 ±0.5 ** 


121 ±2.3 


SZ22(pCPP2006) 


CE 


10.0 


6 


25.2 ± 1.1 ** 


122 ± 5.0 


P2(pCPP2006) 


CP 


5.0 


6 


15.2 ±0.3 


100 ± 1.7 


SZ2I(pCPP2006) 


CP 


5.0 


6 


17.8 ±0.4** 


116 ±2.2 


P2(pCPP2006) 


CP 


10.0 


6 


25.3 ± 0.7 


100 ±2.6 


SZ21(pCPP2006) 


CP 


10.0 


6 


27.2 ± 0.3 ** 


107 ± 1.2 



10 



15 



20 



25 



30 



35 



a> Cultures without added cellulose (100 g/liter) or Spezyme produced 0.22 ± 0.01 g/liter of 
ethanol. Spezyme contained approximately 100 FPU/ml. Addition of 5 ml and 10 ml of 

Spezyme corresponds to 5 FPU/g and 10 FPU/g cellulose, respectively. 
b) Student t-test shows that there is significant difference in ethanol production compared to the respective 
P2 controls at each Spezyme dilution. P value <0.001 is indicated by a two asterisks. 
P value <0.05 is indicated by a single asterisk. 

Although very low levels of ethanol were produced by all strains in the absence 
of Spezyme®, strains SZ6(pCPP2006) and SZ21(pCPP2006) containing functional cell 
genes produced higher levels of ethanol (p<0.05) than strain SZ22(pCPP2006) 
containing only a functional celY gene and strain P2(pCPP2006) lacking endoglucanase 
genes. In the absence of both Spezyme® and Sigmacell 50®, all strains produced 0.22 
g/L ethanol. The additional increment of ethanol produced by SZ6(pCPP2006) and 
SZ21(pCPP2006) during incubation with Sigmacell 50 is attributed to hydrolysis of the 
small fraction of amorphous cellulose in the substrate by EGZ (Zhou, et al (1999) J. of 
Industrial Microbiol Biotechnol. 22:600-607; Zhou, et al (1999) B. Appl Environ. 
Microbiol 65:2439-2445; Zhou, S., et al (2000)/. Bacteriol 182:5676-5682). 
Digestion of amorphous cellulose by EGY produces saccharides that are too large to be 
transported and metabolized without further hydrolysis (Zhou, S., et al (2000) J. 
Bacteriol 182:5676-5682). 

Spezyme CE® and Spezyme CP® contain a commercially optimized combination 
of endoglucanase, exoglucanase, and cellobiase activities (Beguin, et al (1994) FEMS 
Microbiol Rev. 13:25-58; Boyer, et al (1987) Eur. J. Biochem. 162:31 1-316; Nieves, et 
al (1998) World Journal of Microbiology and Biotechnology 14:310-314; Ohmiya, et 
al (1997) Biotechnol Genetic Eng. Rev. 14:365-414). Despite this optimization, 
Spezyme®-supplemented fermentations with two of the endoglucanase producing 
biocatalysts, SZ21(pCPP2006) and SZ22(pCPP2006), produced significantly higher 
levels of ethanol than the control P2(pCPP2006) which lacks endoglucanase genes. The 
combinations of Spezyme® and SZ21(pCPP2006) and SZ22(pCPP2006) were 
synergistic in terms of ethanol production, up to 20% higher than the sum of ethanol 
produced by each individually (p<.005). Synergy was observed for both dilutions of 
Spezyme CE®, and for Spezyme CP®. This synergistic effect can be attributed primarily 
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to EGY since this is the only endoglucanase produced by SZ22(pCPP2006). No 
synergy was observed for SZ6(pCPP2006) which produces only EGZ. 

EXAMPLE 5 

5 Simultaneous Saccharification and Fermentation of Amorphous Cellulose to 

Ethanol by Recombinant Klebsiella oxytoca SZ21 without Supplemental Cellulase 

In this example, a derivative of Klebsiella oxytoca M5A1 containing 
chromosomally integrated genes for ethanol production from Zymomonas mobilis (pdc, 
adhE) and endoglucanase genes from Erwinia chrysanthemi {celY } celZ) is demonstrated 
10 to produced over 20,000 U/L of endoglucanase activity during fermentation. In 
particular, this strain is demonstrated to, in combination with its native ability to 
metabolize cellobiose and cellotriose, ferment amorphous cellulose to ethanol (58-76% 
of theoretical yield) without the addition of cellulase enzymes from other organisms. 

Throughout this example, the following materials and methods are used unless 
py 15 otherwise stated. 

.3 

Materials and Methods 

O Bacteria, plasmids and culture conditions 

?\ Four ethanologenic derivatives of Klebsiella oxytoca M5 Al were used in this 

y3 20 study. Strain P2 contains chromosomally integrated pdc and adhB genes from Z mobilis 

J for ethanol production (Wood et a/., (1992) Appl Environ, Microbiol 58: 2103-21 10). 

This strain was the parent organism for three strains that contain highly expressed, 
chromosomally integrated endoglucanase genes from E. chrysanthemi (Zhou et al. 
(2001) Appl Environ. Microbiol. 67: 6-14). Strains SZ6, SZ22, and SZ21 contain celZ 
25 alone, celY alone, and both ce/7and celZ, respectively. Additional genes 

(approximately 1 5 out genes) were required for the efficient secretion of endoglucanase 
CelZ (He et al (1991) Proc. Natl Acad. ScL USA 88:1079-1083; Pugsley et al, (1997) 
Gene 192: 13-19).and were supplied by plasmid pCPP2006. For consistency, this 
plasmid has been inserted into all strains. Recombinant K oxytoca strains were grown 
30 in Luria broth (per liter: 10 g yeast extract, 5 g tryptone, 5 g NaCl) containing a 

carbohydrate. Spectinomycin (100 mg/L) was included to maintain plasmid pCPP2006. 
Seed cultures (5% glucose, 12-16 h) and fermentations were incubated at 35°C (120 
rpm) and were vented using a 22 gauge needle. 



35 



Assay of endoglucanase activity 

Total endoglucanase activity (Cel Y and CelZ) was determined for cultures 
grown for 24 hours in Luria broth containing 5% sorbitol to minimize interference 
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during the determination of reducing sugars (Zhou et al. (2000) J. Bacteriol 1 82: 5676- 
5682). Cell suspensions in broth were briefly treated with ultrasound to release bound 
enzymes. Dilutions were assayed at 35°C in 50 mM sodium citrate buffer (pH 5.2) 
using carboxymethyl cellulose as the substrate (2%). Reactions were terminated by 
5 heating (10 min, 100°C). Activity was expressed as (imole per min of reducing sugar 
(U) using glucose as a standard. 

Thin layer chromatography (TLC) analysis of cellobiosides 

Cellobiosides were separated using Whatman LK5 silica gel plates (Cat. No. 
10 4855-620). Plates were developed in solvent (6 parts chloroform, 7 parts acetic acid, 
and 1 part water), air-dried, and developed a second time in the same solvent. 
Saccharides were visualized by spraying with 6.5 mM N-(l-naphthyl)ethylenediamine 
dihydrochloride and heating to 100°C for 10 min (Zhou et al. supra). 



1 H 1 5 Preparation of cellobiosides from microcrystalline cellulose 

Cellulose was converted to cellobiosides based on the method of Pereira et al 
(1988). Sigmacell type 50 cellulose (100 g; Cat. No. S-5504) was slowly added to 400 g 
of 72% H 2 S0 4 (w/w) and allowed to hydrolyze at 22°C with stirring for 18 hr. The 
digested slurry was precipitated by slowly adding to 2.5 L of cold ethanol (100%), 
20 centrifiiged, and the pellet washed twice with 1 L of cold ethanol. The pellet was 

dissolved in 100 ml of H2O, adjusted to pH 6, and centrifiiged at 5000 x g for 20 min to 
remove insoluble fibers. The supernatant containing cellobiosides was dried and 
analyzed. 

25 HPLC analysis of cellobiosides 

Cellobiosides were analyze using a Waters HPLC equipped with a refractive 
index monitor and digital integrator. Separations were performed at 65 °C using a 
BioRad HPX-42A column with distilled water as the mobile phase (0.6 ml/min). 

30 Fermentation of cellobiosides 

Initial fermentations used cellobiosides purchased from the Sigma Chemical 
Company (Cat. No. C- 8071). Seed cultures (1 12.5 jiL) were typically combined with 
37.5 |iL of 4% cellobiosides in a 1 .5-ml microcentrifuge tube and incubated for 36 h 
(35°C and 120 rpm). Samples (50 ^iL each; 0 h, 10 h, and 36 h) were heated (100°C, 10 
35 min) to inactivate enzymes and centrifuged (10,000 x g, 5 min) to remove cell debris. 
Supernatants (4 |xL) were then spotted on TLC plates for analysis. 
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Small fermentations (50-ml flasks, 5 ml broth volume) were conducted in a 
similar fashion with laboratory samples of cellobiosides prepared by sulfuric acid 
hydrolysis. Cellobiosides (6 g/L total for saccharides less than 7 glucosyl residues) were 
filter sterilized in Luria broth. Flasks were inoculated with pelleted cells (5,000 x g, 10 
5 min) from a seed culture (dry weight of 0.33 g/L) to eliminate ethanol produced from 
the glucose in seed media. Samples were removed initially, and then at 24-h intervals 
for the determination of the presence of ethanol by gas chromatography (Beall et al 
(1991) Biotechnol Bioeng. 38: 296-303). 

10 Preparation of amorphous cellulose 

Phosphoric acid-swollen cellulose was prepared by a modification of the method 
described by Wood (1988). Approximately 20 g of Avicel (Cat. No. PH105, Fluka 
Chemika) was added slowly to 500 ml 85% H3PO4, stirred at room temperature 
overnight, poured into 4 L of ice cold water, and allowed to settle without agitation for 

15 30 min. After decanting the upper liquid layer, 4 L of cold water was added and mixed 
thoroughly, and repeated 5 times with water, once with 1 % NaHCCb , and 5 more times 
with water (final pH 6-7). The cellulose suspension was concentrated by centriflxgation 
(5000 x g, 20 min) and used as a substrate for fermentation. This viscous suspension 
contained a mixture of crystalline and amorphous cellulose. 

20 The fraction of amorphous cellulose present in the phosphoric acid-swollen 

cellulose was estimated by repeated digestion with CelY and CelZ endoglucanases. A 
24-h culture of SZ21(pCPP2006) was sonicated briefly, centrifuged (5,000 x g, 10 min), 
and the supernatant used as a source of endoglucanase (-20 U/ml). Approximately 1 g 
of the viscous, acid-swollen cellulose was weighed into each of 6 pre- weighed 

25 centrifuge tubes. Two served as controls (no enzyme) and were dried to determine 

initial dry weight and moisture content. Endoglucanase (9 ml) was added to 4 tubes and 
these were incubated at 35°C for 12 h. Chloroform (2 drops) was added to each tube to 
retard microbial growth. After centrifugation (5,000 x g, 20 min), this process was 
repeated with new enzyme preparations for a total of 6 successive treatments over a 72 h 

30 period. Tubes were removed at various times, centrifuged, washed once with distilled 
water to remove soluble products and salts, and dried to a constant weight at 70°C. 
Amorphous cellulose was calculated as the reduction in dry matter resulting from 
endoglucanase digestion. 

35 Fermentation of amorphous cellulose 

Acid swollen cellulose (40 g, unsterilized) was combined with 5 ml of a 10X 
concentrate of Luria broth and 5 ml of seed culture (cell dry weight of 0.33 mg/L) in a 
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125-ml flask. Both spectinomycin (100 |ig/ml) and chloramphenicol (40 ng/ml) were 
added to eliminate contamination. The viscous mixture was initially mixed using an 
applicator stick, then incubated at 35°C. Due to the high initial viscosity, flasks were 
mixed at 200 rpm during the first 2 h and subsequently, at 120 rpm. Ethanol was 
5 measured by gas chromatography (Beall et al (1991) Biotechnol Bioeng. 38:296-303). 

Viscosity 

The viscosity of acid-swollen cellulose preparations was estimated at 22°C by 
timing the flow through a vertical 10-ml glass pipette. Solutions of glycerol were used 
10 as standards. Flow times of 2 sec to 75 sec were observed for fermentation broths, 
corresponding to viscosities of 1 to 1,300 centipoise, respectively. 

Results 

Comparison of endoglucanase activities produced by ethanologenic derivatives ofK. 
1 5 oxytoca 

Ethanologenic strains containing endoglucanase genes from E. chrysanthemi 
produced substantial levels of activity during glucose fermentation. The highest activity 
was produced by strain SZ21(pCPP2006) containing both celZ and celY, 29.3 ±1.6 
U/ml of culture. Strain SZ6(pCPP2006) containing celZ alone produced 22.5 ±1.7 U/ml 
20 and strain SZ22(pCPP2006) containing celY ( celZ deletion of strain SZ2 1 ) produced 1 .0 
±0.1 U/ml. Approximately 60% of the activity was secreted by each strain, the balance 
being cell associated and readily released by mild sonication. No endoglucanase activity 
was detected in the parent, strain P2(pCPP2006). 

25 Analysis of cellobiosides 

The presence of cellobiosides was analyzed by thin layer chromatography and by 
HPLC. Preparations from the Sigma Chemical Company and those prepared in our 
laboratory were similar. Lane 1 of both thin layer chromatograms (Figure 17, panels A 
and B ) shows a representative separation of the Sigma product at the beginning of 

30 fermentation. Small amounts of glucose and cellobiose were present with intermediate 
amounts of cellotriose and cellohexaose, and larger amounts of cellotetraose and 
cellopentaose. The most intense region, however, remained at the origin. This region 
contains cellobiosides of greater than 6 residues and other uncharacterized compounds. 
HPLC analyses were in agreement with thin layer chromatograms. The Sigma 

35 product contained 3% glucose, 3% cellobiose, 13% cellotriose, 25% cellotetraose, 22% 
cellopentaose, and 9% cellohexaose. Based on peak areas, approximately 25% of the 
saccharides were longer than 6 residues. Cellobiosides prepared in our laboratory 
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contained 7% glucose, 6% cellobiose, 10% cellotriose, 15% cellotetraose, 12% 
cellopentaose, 19% cellohexaose, and 30% saccharides longer than 6 glucosyl residues. 

Fermentation of cellobiosides 
5 Figure 1 7 shows a thin layer chromatogram comparing the fermentation of 

cellobiosides (Sigma Chemical Company) by strain P2(pCPP2006) and by 
SZ21(pCPP2006), a derivative which secretes both endoglucanases CelY and CelZ. 
Utilization of glucose, cellobiose, and cellotriose by strain P2 was confirmed and is 
particularly evident after 36 h. However, this strain was unable to metabolize longer 

1 0 cellobiosides. In contrast, strain SZ2 1 degraded and metabolized virtually all of the 
separated cellobiosides and a portion of the material at the origin. It is relevant to note 
that after 10 h, the levels of cellobiose and cellotriose in the SZ21 fermentation were 
several-fold higher than initially present. This increase was attributed to the hydrolysis 
of longer cellobiosides by secreted CelY and CelZ. 

1 5 The effectiveness of secreted endoglucanases was also examined during ethanol 

production from laboratory cellobiosides (Table 17). Although only low levels of 
ethanol were produced due to the low substrate concentrations, the benefit of CelZ 
endoglucanase is clearly evident. The parent P2(pCPP2006) and strain SZ22(pCPP2006; 
CelY only) produced half as much ethanol as the two strains that secreted CelZ: 

20 SZ6(pCPP2006; CelZ only) and SZ21(pCPP2006; CelY and CelZ). Endoglucanase 

CelY was of no benefit using cellobiosides as substrates, consistent with the requirement 
for a long-chain substrate. 

Ethanol yields were estimated based on cellobiosides containing less than 7 
glucosyl residues (Table 17). An average of 62% of the theoretical yield was observed 

25 for the two strains producing endoglucanase CelZ. Higher yields (over 90%) were 

previously measured during the fermentation of 1 00 g/L cellobiose by strain P2 (Wood 
& Ingram 1992). It is likely that the lower yields observed with low concentrations of 
cellobiosides result in part from evaporative losses and the diversion of a larger fraction 
of carbon to cell growth. 

30 

Amorphous cellulose content of phosphoric acid swollen cellulose 

Acid swollen cellulose partially recrystallizes during the removal of acid and 
storage due to extensive hydrogen bonding between cellulose ribbons. E. chrysanthemi 
Cel Y and Cel Z hydrolyze amorphous cellulose and carboxymethyl cellulose but are 
35 unable to hydrolyze crystalline cellulose. This resistance of crystalline cellulose to 
endoglucanase hydrolysis was used to estimate the fraction of washed, acid-swollen 
cellulose which remained amorphous as the loss of insoluble material (dry weight). 
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Samples of amorphous cellulose were repeatedly incubated with fresh endoglucanase 
preparations (~20 U/ml). After an initial 12-h treatment, 28% of the dry weight was 
solubilized. After four successive treatments, 43% was solubilized and remained 
constant for the two final treatments. This value, 43%, represents an estimate of the 
5 fraction of cellulose that remained in the amorphous state. The actual fraction of 
amorphous cellulose may be somewhat lower since estimates also include losses of 
crystalline cellulose which may have occurred during centrifugation and washing. 

Fermentation of amorphous cellulose 

10 The functional expression of high levels of endoglucanase is sufficient to permit 

the direct conversion of amorphous cellulose to ethanol by two derivatives of K. oxytoca 
P2 (Table 17; Figure 18). Three concentrations of amorphous cellulose were tested with 
similar results. 

The highest levels were produced by strain SZ21(pCPP2006) which secretes a 
15 combination of CelY and CelZ endoglucanases. Lower levels of ethanol were produced 

by strain SZ6(pCPP2006) which secretes CelZ alone. A small amount of ethanol was 

produced from residual glucose in the seed media by the parent strain, P2(pCPP2006; 

no endoglucanase) and by strain SZ22(pCPP2006) which secreted only CelY. The 

combined effects of CelY and CelZ appears to be synergistic for ethanol production and 
20 ethanol yield despite the lack of efficacy of CelY alone. Ethanol yields for strain 

SZ21(pCPP2006; both enzymes) ranged from 58%-76% of the theoretical maximum. 

This synergistic effect on ethanol production may result from increased hydrolysis due 

to differences in substrate specificity. 

25 Viscosity changes during the fermentation of amorphous cellulose 

Pronounced viscosity changes were observed during the fermentation of 
amorphous cellulose by endoglucanase-secreting strains (Figure 18). The largest 
reduction in viscosity was evident during the first 2 h of incubation for SZ6(pCPP2006; 
CelZ alone) and SZ21(pCPP2006; CelY and CelZ), from a viscosity near that of pure 

30 glycerol to that near water. Less pronounced changes were observed with 

SZ22(pCPP2006; CelY alone) and no change was evident with the parent strain 
P2(pCPP2006) which served as a control. Changes in viscosity were essentially 
complete after 12 h. 

Differences in viscosity were quantified at the end of fermentation for 

35 Experiment 4 in Table 17 (Figure 18, panel C), These varied by three orders of 

magnitude. The highest final viscosity was observed for P2(pCPP2006) which does not 
produce endoglucanase, 1,300 centipoise. Viscosity was reduced by half (500 
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centipoise) during fermentation with SZ22(pCPP2006) as a result of hydrolysis by 
endoglucanase CelY alone. At the end of fermentation, the viscosities of broths from 
both strains that secrete CelZ were essentially equivalent to that of water, 1 centopoise 
for SZ21(pCPP2006; CelY and CelZ) and 2 centipoise for SZ6(pCPP2006; CelZ alone). 
Clearly, CelZ alone was more effective than CelY alone. No further benefit was 
attributed to the production of CelY in combination with CelZ when compared at the 
end of fermentation (96 h) although both enzymes function synergistically in the 
production of ethanol (Table 17; Figure 18) and during saccharification. 

In summary, these results using, e.g., K. oxytoca strain SZ21, demonstrate an 
advancement toward the goal of producing sufficient cellulase enzymes for the direct 
bioconversion of cellobiosides and amorphous cellulose to ethanol without the addition 
of supplemental enzymes. Endoglucanase levels produced by this strain are over 1 0- 
fold over those previously reported for engineered strains of yeast and other bacteria 
during ethanol fermentation (Brestic-Goachet et al 1989, Cho et al 1999, Cho & Yoo 
1999, Misawa et al 1988, Su et al 1993, Van Rensburg et al 1996, 1998). Moreover, 
the level of endoglucanase produced by strain SZ21 is roughly equivalent to 1% of the 
endoglucanase activity present in commercial cellulase concentrates (Nieves et al 1998, 
Tomme^fl/. 1995, Wilson et al 1997). 

The effectiveness of strain SZ21 in the hydrolysis of amorphous cellulose results 
from the combination of two endoglucanase enzymes E. chrysanthemi and function 
synergistically (Guiseppi et al 1991, Zhou & Ingram 2000). Moreover, these secreted 
enzymes function with the phosphotransferase system in K. oxytoca which provides 
efficient uptake and metabolism of cellobiose and cellotriose (Wood & Ingram 1 992, 
Lai etal 1997). 
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TABLE 17. Production of ethanol from cellobiosides and from amorphous cellulose 





bxpt. 


Strain 


Substrate 


Substrate 
(g/L) < a) 


XT 


Ethanol 
(g/iiter) 


0/ 
yo 

Theoretical 
Yield (c) 




1 


P2 (pCPP2006) 


cellobiosides^ 


6.0 


2 


1.06 


33 




1 


SZ6(pCPP2006) 


cellobiosides 


6.0 


2 


2.04 


63 




1 


SZ21(pCPP2006) 


cellobiosides 


6.0 


2 


2.02 


62 




1 


SZ22(pCPP2006) 


cellobiosides 


6.0 


2 


0.93 


26 




2 


P2(pCPP2006) 


amorphous cellulose 


6.85 


3 


1.77 ±0.06 


0 




2 


SZ6(pCPP2006) 


amorphous cellulose 


6.85 


3 


3.97 ±0.12** 


57 




2 


SZ21(pCPP2006) 


amorphous cellulose 


6.85 


3 


4.67 ±0.06** 


76 




2 


SZ22(pCPP2006) 


amorphous cellulose 


6.85 


3 


1.80 ±0.01 


0 




3 


P2 (pCPP2006) 


amorphous cellulose 


15.34 


3 


1.85 ±0.01 


0 




3 


SZ6(pCPP2006) 


amorphous cellulose 


15.34 


3 


6.07 ± 0.18** 


49 




3 


SZ21(pCPP2006) 


amorphous cellulose 


15.34 


3 


7.84 ±0.35** 


70 


S3". 

; : j 


3 


SZ22(pCPP2006) 


amorphous cellulose 


15.34 


3 


1.92 ±0.02 


0 


SI 3 




P2 (pCPP2006) 


amorphous cellulose 


28.96 


1 


1.90 


0 




4 


SZ6(pCPP2006) 


amorphous cellulose 


28.96 


1 


10.1** 


51 


•a 


4 


SZ21(pCPP2006) 


amorphous cellulose 


28.96 


1 


11.3** 


58 




4 


SZ22(pCPP2006) 


amorphous cellulose 


28.96 


1 


1.80 


0 



5 a Inocula for cellobioside fermentations were harvested by centrifugation to eliminate seed broth as a 

source of ethanol. An average of 1 .85 g/L ethanol was produced from residual sugar in the seed broth used 
to inoculate the three fermentations of amorphous cellulose. 

b ** Indicates that the value is significantly different from that of the control strain (P2) lacking genes 
encoding £. chrysanthemi endoglucanases (p<0.001). 
1 0 c Yields are calculated as a percentage of theoretical maxima, assumed to be 0.54 g ethanol per gram of 
cellobiose and 0.56 g ethanol per gram of amorphous cellulose. For amorphous cellulose, yields were 
computed after subtracting ethanol produced from sugar in the inocula. 

d Cellobiosides (g/L) represents the sum of glucose plus cellobiosides containing fewer than 7 glucosyl 
residues. 

15 e The highest coefficient of variation in Experiments No. 2 and No. 3 was 4.5% of the mean value. By 
assuming a similar coefficient of variation (5% of the measured value), significance was also estimated 
for Experiment No. 3. 
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Equivalents 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, many equivalents to the specific embodiments of the invention 
described herein. Such equivalents are intended to be encompassed by the following 
claims. Moreover, any number of genetic constructs, host cells, and methods described 
in United States Patent Nos. 5,821,093; 5,482,846; 5,424,202; 5,028,539; 5,000,000; 
5,487,989, 5,554,520, and 5,162,516, may be employed in carrying out the present 
invention and are hereby incorporated by reference. 



